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COMPOSITIONS AND METHODS FOR THE TREATMENT OF IMMUNE RELATED DISEASES 

5 Field of tlie Invention 

The present invention relates to compositions and methods useful for the diagnosis and treatment of 
immune related diseases. 

Background of the Invention 

10 Immune related and inflammatory diseases are the manifestation or consequence of fairly complex, 

often multiple interconnected biological pathways which in normal physiology are critical to respond to 
insult or injury, initiate repair from insult or injury, and mount innate and acquired defense against foreign 
organisms. Disease or pathology occurs when these normal physiological pathways cause additional insult 
or injury either as directly related to the intensity of the response, as a consequence of abnormal regulation 

15 or excessive stimulation, as a reaction to self, or as a combination of these. 

Though the genesis of these diseases often involves multistep pathways and often multiple different 
biological systems/pathways, intervention at critical points in one or more of these pathways can have an 
ameliorative or therapeutic effect Therapeutic intervention can occur by eitlier antagonism of a detrimental 
process/pathway or stimulation of a beneficial process/pathway. 

20 Many immune related diseases are known and have been extensively studied. Such diseases 

include immune-mediated inflammatory diseases, non-immune-mediated inflammatory diseases, infectious 
diseases, inmiunodeficiency diseases, neoplasia, etc. 

Immune related diseases could be treated b^^ suppressing the immune response. Using neutralizing 
antibodies that inliibit molecules having inunune stimulatory activity would be beneficial in the treatment of 

25 immune-mediated and inflammatory diseases. Molecules which inhibit the immune response can be utilized 
(proteins directly or via the use of antibody agonists) to inhibit the inamune response and thus ameliorate 
inimune related disease. 

Macrophages represent an ubiquitously distributed population of jBxed and circulating mononuclear 
phagocytes that express a variety of fiinctions including cytokine production, killing of microbes and tumor 

30 cells and processing and presentation of antigens. Macrophages originate in the bone marrow ftom stem 
cells that give rise to a bipotent granulocyte/macrophage cell population. Distinct granulocyte and 
macrophage colony forming cell lineages arise from GM-CSF under the influence of specific cytokines. 
Upon division, monoblasts give rise to promonocytes and monocytes in the bone marrow. From there, 
monocytes enter the circulation. In response to particular stimuli (e.g. infection or foreign bodies) 

35 monoc3^es migrate into tissues and organs where they differentiate into macrophages. 

Macrophages in various tissues vary in their morphology and function and have been assigned 
different names, e.g. Kupffer cells in the liver, pulmonary and alveolar macrophages in the lung and 
microglial cells in the central nervous system. However, the relationship between blood monocytes and 
tissue macrophages remains unclear. 
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In the present study monocytes were differentiated into macrophages by adherence to plastic in the 
presence of a combination of human and bovine serum. After 7 days in culture, monocytes-derived 
macrophages display features typical of differentiated tissue macrophages including their ability to 
phagocytose opsonized particles, secretion of TNF-alpha upon lipopolysaccharide (LPS) stimulation, 
5 formation of processes and the presence of macrophage cell surface markers. 

Using microarray technologies, gene transcripts from non-differentiated monocytes harvested 
before adhering were compared with those at 1 day and 7 days in culture. Genes selectively expressed in 
monocytes or macrophages could be used for the diagnosis and treatment of various chronic inflammatory or 
autoimmune diseases in the hmnan. In particular, surface expressed molecules or transmembrane receptors 
10 involved in monocyte/macrophage adhesion and endothelial cell transmigration could provide novel targets 
to treat chronic inflammation by interference with the homing of these cells to the site of inflammation. In 
addition, transmembrane inhibitory receptors could be used to down-regulate monocyte/macrophage effector 
functions. Therapeutic molecules can be antibodies, peptides, fusion proteins or small molecules. 

Despite the above research in monocyte/macrophages, there is a great need for additional 
15 diagnostic and therapeutic agents capable of detecting the presence of monocyte/macrophage mediated 
disorders in a mammal and for effectively reducing these disorders. Accordingly, it is an objective of the 
present invention to identify polypeptides that are differentially expressed in macrophages as compared to 
non-differentiated monocytes, and to use those polypeptides, and their encoding nucleic acids, to produce 
compositions of matter useful in the therapeutic treatment and diagnostic detection of 
20 monoc3^e/macrophage mediated disorders in mammals. 

Summarv of the Invention 

A. Embodiments 

The present invention concerns compositions and methods useful for the diagnosis and treatment of 
25 immune related disease in mammals, including humans. The present invention is based on the identification 
of proteins (including agonist and antagonist antibodies) which are a result of stimulation of the immune 
response in mammals. Immime related diseases can be treated by suppressing or enhancing the immune 
response. Molecules tliat enhance the immune response stimulate or potentiate the immune response to an 
antigen. Molecules which stimulate the immune response can be used therapeutically where enhancement of 
30 the immune response would be beneficial. Alternatively, molecules that suppress the immune response 
attenuate or reduce the inornxme response to an antigen (e.g,^ neutralizing antibodies) can be used 
therapeutically where attenuation of the immune response would be beneficial (e.g., inflammation). 
Accordingly, the PRO pol3^eptides, agonists and antagonists thereof are also useful to prepare medicines 
and medicaments for the treatment of immune-related and inflammatory diseases. In a specific aspect, such 
35 medicines and medicaments comprise a therapeutically effective amount of a PRO polypeptide, agonist or 
antagonist thereof with a pharmaceutically acceptable carrier. Preferably, the admixture is sterile. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists to a 
PRO polypeptide which comprises contacting the PRO polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said PRO polypeptide. Preferably, the PRO polypeptide is a 
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native sequence PRO polypeptide. In a specifLc aspect, the PRO agonist or antagonist is aa anti-PRO 
antibody. 

In another embodiment, tlae invention concerns a composition of matter comprising a PRO 
polypeptide or an agonist or antagonist antibody which binds the polypeptide in admixture with a carrier or 
5 excipient. In one aspect, the composition comprises a therapeutically effective amount of the polypeptide or 
antibody. In another aspect, when the composition comprises an immune stimulating molecule, the 
composition is useful for: (a) iucreasing infiltration of inflammatory cells into a tissue of a mammal in need 
thereof, (b) stimulating or enhancing an immune response in a mammal in need thereof, (c) increasing the 
proliferation of monocj^es/macrophages in a mammal iii need thereof in response to an antigen, (d) 

10 stimulating the activity of monocytes/macrophages or (e) increasing the vascular permeability. In a further 
aspect, when the composition comprises an immune inhibiting molecule, the composition is useful for: (a) 
decreasing infiltration of inflammatory cells into a tissue of a mammal in need tliereof, (b) inhibiting or 
reducing an immune response in a marmnal in need thereof, (c) decreasing the activity of 
monocytes/macrophages or (d) decreasing the proliferation of monoc3^es/macrophages in a mammal in need 

15 thereof in response to an antigen. In another aspect, the composition comprises a further active ingredient, 
which may, for example, be a further antibody or a cytotoxic or chemotherapeutic agent. Preferably, the 
composition is sterile. 

In another embodiment, the invention concerns a method of treating an immime related disorder in 
a mammal in need thereof, comprising administering to the mammal an effective amount of a PRO 

20 polypeptide, an agonist thereoJ^ or an antagonist thereto. In a preferred aspect, the immune related disorder 
is selected from the group consisting of: systemic lupus erytiiematosis, rheumatoid arthritis, osteoarthritis, 
juvenile chronic arthritis, spondyloarthropathies, systemic sclerosis, idiopathic inflanmiatory myopathies, 
Sjogren's syndrome, systemic vasculitis, sarcoidosis, autoimmune hemolytic anemia, autoinmaune 
thrombocytopenia, thyroiditis, diabetes mellitus, immune-mediated renal disease, demyelinating diseases of 

25 the central and peripheral nervous systems such as multiple sclerosis, idiopathic demyeliaating 
polyneuropalliy or GuiUain-Barre syndrome, and chronic inflammatory demyelinating polyneuropathy, 
hepatobiliary diseases such as infectious, autoimmune chronic active hepatitis, primary biliary cirrhosis, 
granulomatous hepatitis, and sclerosing cholangitis, inflammatory bowel disease, gluten-sensitive 
enteropathy, and Whipple's disease, autoimmime or immime-mediated skin diseases including bullous skin 

30 diseases, erythema multiforme and contact dermatitis, psoriasis, allergic diseases such as asthma, allergic 
rliinitis, atopic dermatitis, food hypersensitivity and urticaria, immunologic diseases of the lung such as 
eosinophilic pneumonias, idiopathic pulmonary fibrosis and hypersensitivity pneumonitis, transplantation 
associated diseases including graft rejection and graft -versus-host-disease. 

In another embodiment, the invention provides an antibody which specifically binds to any of the 

35 above or below described polypeptides. Optionally, the antibody is a monoclonal antibody, humanized 
antibody, antibody fragment or single-chain antibody. In one aspect, the present invention concerns an 
isolated antibody which binds a PRO polypeptide. In another aspect, the antibody mimics thie activity of a 
PRO polypeptide (an agonist antibody) or conversely the antibody inhibits or neutralizes the activity of a 
PRO polypeptide (an antagonist antibody). In another aspect, the antibody is a monoclonal antibody, which 

40 preferably has nonhuman complementarity determining region (CDR) residues and human framework region 
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(FR) residues. The antibody may be labeled and may be immobilized on a solid support. In a further aspect, 
the antibody is an antibody fragment, a monoclonal antibody, a single-chain antibody, or an anti-idiotypic 
antibody. 

In yet another embodiment, the present invention provides a composition comprising an anti-PRO 
5 antibody in admixture with a pharmaceutically acceptable carrier. In one aspect, the composition comprises 
a therapeutically effective amount of the antibody. Preferably, the composition is sterile. The composition 
may be administered in the form of a liquid pharmaceutical formulation, which may be preserved to achieve 
extended storage stability. Alternatively, the antibody is a monoclonal antibody, an antibody fragment, a 
humanized antibody, or a single-chain antibody. 
10 In a further embodiment, the invention concerns an article of manufacture, comprising: 

(a) a composition of matter comprising a PRO polypeptide or agonist or antagonist thereof; 

(b) a container containing said composition; and 

(c) a label affixed to said container, or a package insert included in said container referring to 
the use of said PRO polypeptide or agonist or antagonist thereof in the treatment of an immune related 

15 disease. The composition may comprise a therapeutically effective amount of the PRO polypeptide or the 
agonist or antagonist thereof. 

In yet another embodiment, the present invention concerns a method of diagnosing an immune 
related disease in a mammal, comprising detecting the level of expression of a gene encoding a PRO 
polypeptide (a) in a test sample of tissue cells obtained from the mammal, and (b) in a control sample of 

20 known normal tissue ceUs of the same cell type, wherein a higher or lower expression level in the test 
sample as compared to the control sample indicates the presence of immune related disease in the mamanal 
from which the test tissue cells were obtained. 

In another embodiment, the present invention concerns a method of diagnosing an immune disease 
in a mammal, comprising (a) contacting an anti-PRO antibody with a test sample of tissue cells obtained 

25 from the mammal, and (b) detecting the formation of a complex between the antibody and a PRO 
polypeptide, in the test sample; wherein the formation of said complex is indicative of the presence or 
absence of said disease. The detection may be qualitative or quantitative, and may be performed in 
comparison with monitoring the complex formation in a control sample of known normal tissue cells of the 
same cell type. A larger quantity of complexes formed in the test sample indicates the presence or absence 

30 of an immune disease in the mammal from which the test tissue cells were obtained. The antibody 
preferably carries a detectable label. Complex formation can be monitored, for example, by light 
microscopy, flow cytometry, fluorimetry, or other techniques known in the art. The test sample is usually 
obtained from an individual suspected of having a deficiency or abnormality of the immune system. 

In another embodiment, the invention provides a method for determining the presence of a PRO 

35 polypeptide in a sample comprising exposing a test sample of cells suspected of containing the PRO 
polypeptide to an anti-PRO antibody and determining the binding of said antibody to said cell sample. In a 
specific aspect, the sample comprises a cell suspected of containing the PRO polypeptide and the antibody 
binds to the cell. The antibody is preferably detectably labeled and/or bound to a solid support. 

In another embodiment, the present invention concerns an immune-related disease diagnostic kit, 

40 comprising an anti-PRO antibody and a carrier in suitable packaging. The kit preferably contains 



wo 2004/041170 



PCT/US2003/034312 



instructions for using the antibody to detect the presence of the PRO polypeptide. Preferably the carrier is 
pharmaceutically acceptable. 

In another embodiment, the present invention concerns a diagnostic kit, containing an anti-PRO 
antibody in suitable packaging. The kit preferably contains instructions for using the antibody to detect the 
5 PRO polypeptide. 

In another embodiment, the invention provides a method of diagnosing an immune-related disease 
in a manamal which comprises detecting the presence or absence or a PRO polypeptide in a test sample of 
tissue cells obtained from said mammal, wherein the presence or absence of the PRO polypeptide in said test 
sample is indicative of the presence of an immune-related disease in said mammal. 
10 In another embodiment, the present invention concerns a method for identifying an agonist of a 

PRO polypeptide comprising: 

(a) contacting cells and a test compound to be screened under conditions suitable for the induction 
of a cellular response normally induced by a PRO polypeptide; and 

(b) determining the induction of said cellular response to determine if the test compound is an 
15 effective agonist, wherein the induction of said cellular response is indicative of said test compound being an 

effective agonist. 

In another embodiment, the invention concerns a method for identifying a compound capable of 
inhibiting the activity of a PRO polypeptide comprising contacting a candidate compound with a PRO 
polypeptide under conditions and for a time sufficient to allow these two components to interact and 
20 determining whether the activity of the PRO polypeptide is inhibited. In a specific aspect, either the 
candidate compound or the PRO polypeptide is immobilized on a solid support. In another aspect, the non- 
immobilized component carries a detectable label. In a preferred aspect, this method comprises the steps of: 
(a) contacting cells and a test compound to be screened in the presence of a PRO polypeptide under 
conditions suitable for die induction of a cellular response normally induced by a PRO polypeptide; and 
25 (b) determining the induction of said cellular response to determine if the test compound is an 

effective antagonist. 

In another embodiment, the invention provides a method for identifying a compound that inhibits 
the expression of a PRO polypeptide in cells that normally express the polypeptide, wherein the method 
comprises contacting the cells with a test compound and determining whether the expression of the PRO 
30 polypeptide is inhibited. In a preferred aspect, this method comprises the steps of: 

(a) contacting cells and a test compound to be screened under conditions suitable for allowing 
expression of the PRO polypeptide; and 

(b) determining the inhibition of expression of said polypeptide. 

In yet another embodiment, the present invention concerns a method for treating an iromune-related 
35 disorder in a mammal that suffers therefrom comprising administering to the mammal a nucleic acid 
molecule that codes for either (a) a PRO polypeptide, (b) an agonist of a PRO polypeptide or (c) an 
antagonist of a PRO polypeptide, wherein said agonist or antagonist may be an anti-PRO antibody. In a 
preferred embodiment, the mammal is human. In another preferred embodiment, the nucleic acid is 
administered via ex vivo gene therapy. In a further preferred embodiment, the nucleic acid is comprised 
40 within a vector, more preferably an adenoviral, adeno-associated viral, lentiviral or retroviral vector. 
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In yet another aspect, the invention provides a recombinant viral particle comprising a viral vector 
consisting essentially of a promoter, nucleic acid encoding (a) a PRO polypeptide, (b) an agonist polypeptide 
of a PRO polypeptide, or (c) an antagonist polypeptide of a PRO polypeptide, and a signal sequence for 
cellular secretion of the polypeptide, wherein the viral vector is in association with viral structural proteins. 
5 Preferably, the signal sequence is from a mammal, such as from a native PRO polypeptide. 

In a still ftirther embodiment, the invention concems an ex vivo producer cell comprising a nucleic 
acid construct that expresses retroviral structural proteins and also comprises a retroviral vector consisting 
essentially of a promoter, nucleic acid encoding (a) a PRO polypeptide, (b) an agonist polypeptide of a PRO 
polypeptide or (c) an antagonist polypeptide of a PRO polypeptide, and a signal sequence for cellular 
10 secretion of the polypeptide, wherein said producer cell packages the retroviral vector in association with the 
structural proteins to produce recombinant retroviral particles. 

In a still frirther embodiment, the invention provides a method of increasing the activity of 
monocytes/macrophages in a mammal comprising administering to said mammal (a) a PRO polypeptide, (b) 
an agonist of a PRO polypeptide, or (c) an antagonist of a PRO polypeptide, wherein the activity of 
15 monoc3^es/macrophages in the manmial is increased. 

In a still further embodiment, the invention provides a method of decreasing the activity of 
monoc3^es/macrophages in a maromal comprising administering to said mammal (a) a PRO polypeptide, (b) 
an agonist of a PRO polypeptide, or (c) an antagonist of a PRO polypeptide, wherein the activity of 
monocytes/macrophages in the mammal is decreased. 
20 In a still ftirfher embodiment, the invention provides a method of increasing tihe proliferation of 

monocytes/macrophages in a mamutnal comprising administering to said mammal (a) a PRO polypeptide, (b) 
an agonist of a PRO polypeptide, or (c) an antagonist of a PRO polypeptide, wherein the proliferation of 
monocytes/macrophages in the mammal is increased. 

In a still frirther embodiment, the invention provides a method of decreasing the proliferation of 
25 monocytes/macrophages in a mammal comprising administering to said mammal (a) a PRO polypeptide, (b) 
an agonist of a PRO polypeptide, or (c) an antagonist of a PRO polypeptide, wherein the proliferation of 
monocytes/macrophages in the mammal is decreased. 

B. Additional Embodiments 

In other embodiments of the present invention, the invention provides vectors comprising DNA 
30 encoding any of the herein described polypeptides. Host cell comprising any such vector are also provided. 
By way of example, the host cells may be CHO cells, coli, or yeast A process for producing any of the 
herein described polypeptides is frirther provided and comprises culturing host cells under conditions 
suitable for expression of the desired polypeptide and recovering the desired polypeptide from the cell 
culture. 

35 In other embodiments, the invention provides chimeric molecules comprising any of the herein 

described polypeptides fiised to a heterologous polypeptide or amino acid sequence. Example of such 
chimeric molecules comprise any of the herein described polypeptides ftised to an epitope tag sequence or a 
Fc region of an immimoglobulin. 
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In another embodiment, the invention provides an antibody which specifically binds to any of the 
above or below described polypeptides. Optionally, tlie antibody is a monoclonal antibody, humanized 
antibody, antibody fragment or single-chain antibody. 

In yet other embodiments, the invention provides oligonucleotide probes useful for isolating 
5 genomic and cDNA nucleotide sequences or as antisense probes, wherein those probes may be derived from 
any of the above or below described nucleotide sequences. 

In other embodiments, the invention provides an isolated nucleic acid molecule comprising a 
nucleotide sequence that encodes a PRO polypeptide. 

In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 

10 about 80% nucleic acid sequence identity, alternatively at least about 81% nucleic acid sequence identity, 
alternatively at least about 82% nucleic acid sequence identity, alternatively at least about 83% nucleic acid 
sequence identity, alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 
85% nucleic acid sequence identity, alternatively at least about 86% nucleic acid sequence identity, 
alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid 

15 sequence identity, alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 
90% nucleic acid sequence identity, alternatively at least about 91% nucleic acid sequence identity, 
alternatively at least about 92% nucleic acid sequence identity, alternatively at least about 93% nucleic acid 
sequence identity, alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 
95% nucleic acid sequence identity, alternatively at least about 96% nucleic acid sequence identity, 

20 alternatively at least about 97% nucleic acid sequence identity, alternatively at least about 98% nucleic acid 
sequence identity and alternatively at least about 99% nucleic acid sequence identity to (a) a DNA molecule 
encoding a PRO polypeptide having a friU-length amino acid sequence as disclosed herein, an amino acid 
sequence lacking the signal peptide as disclosed herein, an extracellular domain of a transmembrane protein, 
with or without the signal peptide, as disclosed herein or any other specifically defined fragment of the fiill- 

25 length amino acid sequence as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In other aspects, tlie isolated nucleic acid molecule comprises a nucleotide sequence having at least 
about 80% nucleic acid sequence identity, alternatively at least about 81% nucleic acid sequence identity, 
alternatively at least about 82% nucleic acid sequence identity, alternatively at least about 83% nucleic acid 
sequence identity, alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 

30 85% nucleic acid sequence identity, alternatively at least about 86% nucleic acid sequence identity, 

alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid 
sequence identity, alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 
90% nucleic acid sequence identity, alternatively at least about 91% nucleic acid sequence identity, 
alternatively at least about 92% nucleic acid sequence identity, alternatively at least about 93% nucleic acid 

35 sequence identity, alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 
95% nucleic acid sequence identity, alternatively at least about 96% nucleic acid sequence identity, 
alternatively at least about 97% nucleic acid sequence identity, alternatively at least about 98% nucleic acid 
sequence identity and alternatively at least about 99% nucleic acid sequence identity to (a) a DNA molecule 
comprising the coding sequence of a full-length PRO polypeptide cDNA as disclosed herein, die coding 

40 sequence of a PRO polypeptide lacking the signal peptide as disclosed herein, the coding sequence of an 
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extracellular domain of a transmembrane PRO polypeptide, with or withiout the signal peptide, as disclosed 
herein or the coding sequence of any other specilBcally dejQned jfragment of the full-length amino acid 
sequence as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising a 
5 nucleotide sequence having at least about 80% nucleic acid sequence identity, alternatively at least about 
81% nucleic acid sequence identity, alternatively at least about 82% nucleic acid sequence identity, 
alternatively at least about 83% nucleic acid sequence identity, alternatively at least about 84% nucleic acid 
sequence identity, alternatively at least about 85% nucleic acid sequence identity, alternatively at least about 
86% nucleic acid sequence identity, alternatively at least about 87% nucleic acid sequence identity, 

10 alternatively at least about 88% nucleic acid sequence identity, alternatively at least about 89% nucleic acid 
sequence identity, alternatively at least about 90% nucleic acid sequence identity, alternatively at least about 
91% nucleic acid sequence identity, alternatively at least about 92% nucleic acid sequence identity, 
alternatively at least about 93% nucleic acid sequence identity, alternatively at least about 94% nucleic acid 
sequence identity, alternatively at least about 95% nucleic acid sequence identity, alternatively at least about 

15 96% nucleic acid sequence identity, alternatively at least about 97% nucleic acid sequence identity, 

alternatively at least about 98% nucleic acid sequence identity and alternatively at least about 99% nucleic 
acid sequence identity to (a) a DNA molecule that encodes the same mature polypeptide encoded by any of 
the human protein cDNAs as disclosed herein, or (b) the complement of the DNA molecule of (a). 

Another aspect the invention provides an isolated nucleic acid molecule comprising a nucleotide 

20 sequence encoding a PRO polypeptide which is either transmembrane domain-deleted or transmembrane 

domain-inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane 
domain(s) of such polypeptide are disclosed hereiu. Therefore, soluble extracellular domains of the herein 
described PRO polypeptides are contemplated. 

Another embodiment is directed to fragments of a PRO polypeptide coding sequence, or the 

25 complement thereof, that may find use as, for example, hybridization probes, for encoding fragments of a 
PRO polypeptide that may optionally encode a polypeptide comprising a binding site for an anti-PRO 
antibody or as antisense oligonucleotide probes. Such nucleic acid fragments are usually at least about 20 
nucleotides in length, alternatively at least about 30 nucleotides in length, alternatively at least about 40 
nucleotides in length, alternatively at least about 50 nucleotides in length, alternatively at least about 60 

30 nucleotides in length, alternatively at least about 70 nucleotides in length, alternatively at least about 80 
nucleotides in length, alternatively at least about 90 nucleotides in length, alternatively at least about 100 
nucleotides in length, alternatively at least about 110 nucleotides in length, alternatively at least about 120 
nucleotides in length, alternatively at least about 130 nucleotides in length, alternatively at least about 140 
nucleotides in length, alternatively at least about 150 nucleotides in length, alternatively at least about 160 

35 nucleotides in length, alternatively at least about 170 nucleotides in length, alternatively at least about 180 
nucleotides in length, alternatively at least about 190 nucleotides in length, alternatively at least about 200 
nucleotides in length, alternatively at least about 250 nucleotides in length, alternatively at least about 300 
nucleotides in length, alternatively at least about 350 nucleotides in length, alternatively at least about 400 
nucleotides in length, alternatively at least about 450 nucleotides in length, alternatively at least about 500 

40 nucleotides in length, alternatively at least about 600 nucleotides in length, alternatively at least about 700 
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nucleotides in length, alternatively at least about 800 nucleotides in length, alternatively at least about 900 
nucleotides in length and alternatively at least about 1000 nucleotides in length, wherein in this context the 
term "about" means the referenced nucleotide sequence length plus or minus 10% of that referenced length. 
It is noted that novel fragments of a PRO polypeptide-encoding nucleotide sequence may be determined in a 
5 routine manner by aligning the PRO polypeptide-encoding nucleotide sequence with other known nucleotide 
sequences using any of a nvunber of well known sequence alignment programs and determining which PRO 
polypeptide-encoding nucleotide sequence fi:agment(s) are novel. All of such PRO polypeptide-encoding 
nucleotide sequences are contemplated herein. Also contemplated are the PRO polypeptide fragments 
encoded by these nucleotide molecule fragments, preferably those PRO polypeptide fragments that comprise 

10 a binding site for an anti-PRO antibody. 

In another embodiment, the invention provides isolated PRO polypeptide encoded by any of the 
isolated nucleic acid sequences herein above identified. 

In a certain aspect, the invention concerns an isolated PRO polypeptide, comprising an amino acid 
sequence having at least about 80% amino acid sequence identity, alternatively at least about 81% amino 

15 acid sequence identity, alternatively at least about 82% amino acid sequence identity, alternatively at least 
about 83% amino acid sequence identity, alternatively at least about 84% amino acid sequence identity, 
alternatively at least about 85% amino acid sequence identity, alternatively at least about 86% amino acid 
sequence identity, alternatively at least about 87% amino acid sequence identity, alternatively at least about 
88% anodno acid sequence identity, alternatively at least about 89% amino acid sequence identity, 

20 alternatively at least about 90% amino acid sequence identity, alternatively at least about 91% amino acid 
sequence identity, alternatively at least about 92% amino acid sequence identity, alternatively at least about 
93% amino acid sequence identity, altematively at least about 94% amino acid sequence identity, 
alternatively at least about 95% amino acid sequence identity, altematively at least about 96% amino acid 
sequence identity, altematively at least about 97% amino acid sequence identity, altematively at least about 

25 98% amino acid sequence identity and altematively at least about 99% amino acid sequence identity to a 
PRO polypeptide having a friU-length amino acid sequence as disclosed herein, an amino acid sequence 
lacking the signal peptide as disclosed herein, an extracellular domain of a transmembrane protein, with or 
without the signal peptide, as disclosed herein or any other specifically defined fragment of the fiill-length 
amino acid sequence as disclosed herein. 

30 In a ftirther aspect, the invention concems an isolated PRO polypeptide comprising an amino acid 

sequence having at least about 80% amino acid sequence identity, altematively at least about 81% amino 
acid sequence identity, altematively at least about 82% amino acid sequence identity, altematively at least 
about 83% amino acid sequence identity, altematively at least about 84% amino acid sequence identity, 
altematively at least about 85% amino acid sequence identity, altematively at least about 86% amino acid 

35 sequence identity, altematively at least about 87% ammo acid sequence identity, altemativel}^ at least about 
88% amino acid sequence identity, altematively at least about 89% amino acid sequence identity, 
alternatively at least about 90% amino acid sequence identity, altematively at least about 91% amino acid 
sequence identity, altematively at least about 92% amino acid sequence identity, altematively at least about 
93% amino acid sequence identity, alternatively at least about 94% amino acid sequence identity, 

40 altematively at least about 95% amino acid sequence identity, altematively at least about 96% amino acid 



wo 2004/041170 



PCT/US2003/034312 



sequence identity, alternatively at least about 97% amino acid sequence identity, alternatively at least about 
98% amino acid sequence identity and alternatively at least about 99% amino acid sequence identity to an 
amino acid sequence encoded by any of the human protein cDNAs as disclosed herein. 

In a specific aspect, the invention provides an isolated PRO polypeptide without the N-terminal 
5 signal sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such 
an amino acid sequence as herein before described. Processes for producing the same are also herein 
described, wherein those processes comprise culturing a host cell comprising a vector which comprises the 
appropriate encoding nucleic acid molecule under conditions suitable for expression of the PRO polypeptide 
and recovering the PRO polypeptide from the cell culture. 

10 Another aspect the invention provides an isolated PRO polypeptide which is either transmembrane 

domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein 
described, wherein those processes comprise culturing a host cell comprisiag a vector which comprises the 
appropriate encoding nucleic acid molecule under conditions suitable for expression of the PRO polypeptide 
and recovering the PRO polypeptide from the cell culture. 

15 In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 

polypeptide as defined herein. In a particular embodiment, the agonist or antagonist is an anti-PRO antibody 
or a small molecule. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists to a 
PRO polypeptide which comprise contacting the PRO polypeptide with a candidate molecule and monitoring 
20 a biological activity mediated by said PRO polypeptide. Preferably, the PRO polypeptide is a native PRO 
polypeptide. 

In a still fiirther embodiment, the invention concerns a composition of matter comprising a PRO 
polypeptide, or an agonist or antagonist of a PRO polypeptide as herein described, or an anti-PRO antibody, 
in combination with a carrier. Optionally, the carrier is a pharmaceutically acceptable carrier. 
25 Another embodiment of the present invention is directed to the use of a PRO polypeptide, or an 

agonist or antagonist thereof as herein before described, or an anti-PRO antibody, for the preparation of a 
medicament useful in the treatment of a condition which is responsive to the PRO polypeptide, an agonist or 
antagonist thereof or an anti-PRO antibody. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

In the list of figures for the present application, specific cDNA sequences which are differentially 
expressed in differentiated macrophages as compared to normal undifferentiated monocytes are 
individually identified with a specific alphanimierical designation. These cDNA sequences are 
differentially expressed in monocytes that are specifically treated as described in Example 1 below. If 

35 start and/or stop codons have been identified in a cDNA sequence shown in the attached figures, they are 
shown in bold and underlined font, and the encoded polypeptide is shown in the next consecutive figure. 

The Figures 1-2517 show the nucleic acids of the invention and their encoded PRO 
polypeptides. Also included, for convenience is a List of Figures attached hereto as Appendix A, which 
gives the figure number and the corresponding DNA or PRO number. 
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Figure 1407: PR058219 

Figure 1408: DNA328644, 349746.9, 209200^t 
Figure 1409: PRO84420 

Figure 1410: DNA326891, NP>001748.1, 209213_at 
Figure 1411: PR083212 

Figure 1412: DNA328645, NP-009006.1, 209216_at 
Figure 1413: PR084421 

Figure 1414: DNA227483, NP-003120.1, 209218-at 
Figure 1415: PR037946 

Figure 1416: DNA328646, NP_036517.1, 209230.^^t 
Figure 1417: PR084422 

Figure 1418A-C: DNA328647, AB017133, 209234^t 
Figure 1419: PR084423 

Figure 1420A-B: DNA328648, D87075, 209236_at 
Figure 1421: DNA328649, NP_1 16093.1, 209251_x^t 
Figure 1422: PR084424 

Figure 1423: DNA255255, NP.-07 1437.1, 209267-S_at 



Figure 1424: PRO50332 

Figure 1425A-B: DNA226827, NP-OOl 673.1, 

20928 Ijs-at 

Figure 1426: PRO37290 

Figure 1427: DNA328650, 200118.10, 209286 jat 
Figure 1428: PR084425 

Figure 1429: DNA274883, NP_000058.1, 209301 >at 

Figure 1430: PR062628 

Figure 1431: DNA328651, AF087853, 209305_S-at 
Figure 1432: PR082889 

Figure 1433: DNA327718, CASP4, 209310-S-at 
Figure 1434: PR083697 

Figure 1435: DNA328652, NP_077298.1, 20932 l-s_at 
Figure 1436: PR084426 

Figure 1437: DNA328653, AF063020, 209337^t 
Figure 1438: PR084427 
Figure 1439: DNA328654, UAPl, 209340^1 
Figure 1440: PR084428 

Figure 1441: DNA328655, 346677.3, 209341 ^_at 
Figure 1442: PR084429 

Figure 1443: DNA269630, NP»003281.1, 209344^t 

Figure 1444: PRO58042 

Figure 1445A-B: DNA328656, HSA303098, 

209345^.at 

Figure 1446: PRO84430 

Figure I447A~B: DNA328657,NP-.060895.1, 

209346.^>at 

Figure 1448: PR084431 

Figure 1449A-B: DNA328658, AF055376, 

209348^.at 

Figure 1450: PR084432 

Figure 1451: DNA327719, NP_003704.2, 209355-s^t 
Figure 1452: PR083698 

Figure 1453: DNA328659, ECMl, 209365^-at 
Figure 1454: PR084433 

Figure 1455: DNA225952, NP.001267.1, 209395^t 
Figure 1456: PR036415 

Figure 1457: DNA275366, BC001851, 209444^t 
Figure 1458: PRO63036 

Figure 1459: DNA328660, NP_003675.2, 209467 js^t 

Figure 1460: PR084434 

Figure 1461A-B: DNA328661,NP-006304.1, 

209475^t 

Figure 1462: PR084435 

Figure 1463: DNA328662, OSBPLIA, 209485..S-at 
Figure 1464: PR084436 

Figure 1465: DNA3 24899, NP_002938.1, 209507 
Figure 1466: PROS 1503 

Figure 1467: DNA274027, HSU38654, 209515^_at 
Figure 1468: PR061971 

Figure 1469: DNA328663,NP-057157.1, 209524^t 

Figure 1470: PR036183 

Figure 1471A-C: DNA328664,NP-009131.1, 

209534jc_at 

Figure 1472: PR084437 

Figure 1473A-B: DNA328665, RGL, 209568^-at 
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Figure 1474: PR084438 

Figure 1475: DNA328666, AF084943, 209585jS-at 
Figure 1476: PRO 19 17 

Figure 1477: DNA328667, S69189, 209600_s_at 
Figure 1478: PR084439 

Figure 1479: DNA328668, NP-003 157.1, 209607 jc^t 
Figure 1480: PRO84440 

Figure 1481: DNA3 28 669, NP»005 8 82.1, 209608 js_at 

Figure 1482: PR084441 

Figure 1483A-B: DNA328670, BC001618, 

209610^^t 

Figure 1484: PRO70011 

Figure 1485: DNA25 6209, NP^002259.1, 209653 jat 
Figure 1486: PR05 1256 

Figure 1487 A-B: DNA272671, HSU26710, 209682_at 
Figure 1488: PRO60796 

Figure 1489: DNA151564, DNA151564, 209683^t 
Figure 1490: PRO 11886 

Figure 1491: DNA327727, NP-.000308.1, 209694^t 
Figure 1492: PRO83705 

Figure 1493: DNA328671, NP-000498.2, 209696^t 
Figure 1494: PR084442 

Figure 1495: DNA327728, BC004492, 209703 jc^at 
Figure 1496: PR04348 

Figure 1497: DNA328672, CAA68871.1, 209707^t 
Figure 1498: PR084444 
Figure 1499A-B: DNA328673, HUMCSDFl, 
2097 16^t 

Figure 1500: PR084445 

Figure 1501A-B: DNA304800, BC002538, 209723^t 

Figure 1502: PR069458 

Figure 1503A-B: DNA328674, NP-05601 1.1, 

209760^t 

Figure 1504: PR084446 

Figure 1505: DNA324250, NP>536349.1, 209761^-at 
Figure 1506: PRO80934 

Figure 1507 A-B: DNA328675, ADAM19, 209765-jat 
Figure 1508: PR084447 

Figure 1509: DNA327731, NP-003302.1, 209803^-at 
Figure 1510: PRO83707 
Figure 1511: DNA328676, IL16, 209827^„at 
Figure 1512: PR084448 

Figure 1513A-B: DNA196499, AB002384, 209829 _at 
Figure 1514: PR024988 

Figure 1515: DNA3 28677, AF0605 11, 209836j>c-at 
Figure 1516: PR084449 

Figure 1517: DNA324805, NP_008978.1, 209846^^t 
Figure 1518: PR081419 

Figure 1519: DNA273915, NP^036215.1, 209864_at 
Figure 1520: PR061867 

Figure 1521: DNA290585, NP.000573.1, 209875^_at 
Figure 1522: PRO70536 

Figure 1523: DNA328678, NP-008843,1, 209882jat 
Figure 1524: PR062586 

Figure 1525: DNA328679, 347423,1, 209892jat 
Figure 1526: PRO84450 



Figure 1527: DNA328258, HSM802616, 209900^^t 

Figure 1528: PR084151 

Figure 1529A-B: DNA328680, NP-062541.1, 

209907_s_at 

Figure 1530: PR084451 

Figure 1531: DNA299884, AB040875, 209921-at 
Figure 1532: PRO70858 

Figure 1533: DNA328681, NP^005089.1, 209928 js-at 
Figure 1534: PR084452 

Figure 1535: DNA272326, NP.006 154,1, 209930^^t 
Figure 1536: PRO60583 

Figure 1537: DNA328682, AF225981, 209935^t 
Figure 1538: PR084453 

Figure 1539: DNA327754, NP^l 50634.1, 209970 jt-at 
Figure 1540: PR04526 

Figure 1541: DNA328683, NP_000399.1, 210007-S^t 
Figure 1542: PR084454 

Figure 1543: DNA227660, NP-001327.1, 210042^-at 
Figure 1544: PR038123 

Figure 1545: DNA327739, AF092535, 210058^t 
Figure 1546: PR083714 

Figure 1547: DNA327740, NP_003944.1, 210087-S^t 
Figure 1548: PR01787 

Figure 1549: DNA328684, BC001234, 210102>at 

Figure 1550: PR084455 

Figure 1551A-B: DNA3 28685, NP.l 27497,1, 

210113-s_at 

Figure 1552: PR034751 

Figure 1553: DNA3 28686, NP-000566.1, 2101 18^^t 
Figure 1554: PR064 

Figure 1555: DNA227757,NP_000743.1, 210128-S-at 

Figure 1556: PRO38220 

Figure 1557: DNA227501, NP-000295.1, 210139^_at 
Figure 1558: PR037964 

Figure 1559: DNA328687, AF004231, 210146-x:-at 

Figure 1560: PR084456 

Figure 1561A-B: DNA328688, NP_006838.2, 

210152_at 

Figure 1562: PR084457 

Figure 1563: DNA328689, NP-003259.2, 210166^t 
Figure 1564: PR07521 

Figure 1565: DNA270196, HUMZFMIB, 210172_at 
Figure 1566: PR058584 

Figure 1567: DNA328690, NP_524145.1, 210240^_at 
Figure 1568: PRO59660 

Figure 1569: DNA326963, HRIHFB2122, 210276_S-at 
Figure 1570: PR083276 

Figure 1571: DNA328691, NP_065717.1, 210346-S-at 
Figure 1572: PR084458 

Figure 1573: DNA227652, NP-002549.1, 210401-at 
Figure 1574: PR038 115 

Figure 1575: DNA225514, NP^003864.1, 210510-S-at 
Figure 1576: PR035977 

Figure 1577: DNA216517, NP_005055.1, 210549^-at 
Figure 1578: PR034269 

Figure 1579: DNA327746, HUMGCBA, 210589^^t 
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Figure 1580: PRO83720 

Figure 1581: DNA328692, AF025529, 210660^t 
Figure 1582: PR084459 

Figure 1583: DNA272127, NP.003928.1, 210663^_at 
Figure 1584: PRO60397 

Figure 1585: DNA326525, NP_006330.1, 210719^.at 
Figure 1586: PR082894 

Figure 1587: DNA226183, NP-001453.1, 210773^-at 
Figure 1588: PR036646 

Figure 1589: DNA226078, NP.000296.1, 210830-S^t 
Figure 1590: PR036541 

Figure 1591: DNA226152, NP^002650.1, 210845^^t 

Figure 1592: PR03 66 15 

Figure 1593: DNA328693, HSU03891, 210873jc^t 
Figure 1594: PRO84460 

Figure 1595: DNA328694, BC007810, 210944 js_at 
Figure 1596: PR084461 

Figure 1597: DNA213676, NP-004604.1, 211003jc^t 
Figure 1598: PR035 142 

Figure 1599: DNA328695, NP-002145.1, 211015^_at 
Figure 1600: PRO61480 

Figure 1601: DNA328696, NP_009214.1, 211026-S.at 
Figure 1602: PRO62720 

Figure 1603: DNA328697, NP-1 16112.1, 21 1038 _S-at 
Figure 1604: PR084462 

Figure 1605: DNA328698, BC006403, 211063^_at 
Figure 1606: PR012168 

Figure 1607: DNA326712, NP_001285,1, 21 1 136-5_at 

Figure 1608: PRO83054 

Figure 1609 A-B: DNA328699, AF189723, 

211137^_at 

Figure 1610: PR084463 

Figure 1611: DNA327752, HSDHACTYL, 

211150_s_at 

Figure 1612A--B: DNA328700, SCD, 211162jc-at 
Figure 1613: PR084464 

Figure 1614: DNA328701, PSEN2, 21 1373-S_at 
Figure 1615: PRO80745 

Figure 1616: DNA328702, NP„036519.U 211413^^t 
Figure 1617: PR084465 

Figure 1618: DNA256637, NP.008849.1, 211423js-at 
Figure 1619: PR051621 

Figure 1620: DNA328703, NP_003956.1, 211434^-at 
Figure 1621: PR01873 

Figure 1622: DNA327755, NP»1 15957.1, 211458^^t 
Figure 1623: PR083725 

Figure 1624A-B: DNA328704, FGFRl, 211535^-at 
Figure 1625: PR034231 
Figure 1626: DNA324626, RIL, 211564^^t 
Figure 1627: PR081272 

Figure 1628: DNA328705, NP_001345.1, 211653jc^t 
Figure 1629: PR062617 

Figure 1630: DNA328706, BC021909, 211714j{:^t 

Figure 1631: PRO 10347 

Figure 1632 A-B: DNA328707, AF172264, 

211828^.at 



Figure 1633: PR084466 

Figure 1634: DNA226582, NP>003863.1, 211844^^t 
Figure 1635: PRO37045 

Figure 1636: DNA151912,BAA06683,1, 211935^t 
Figure 1637: PRO 12756 

Figure 1638: DNA325941, NP-005339.1, 211968-s^at 
Figure 1639: PR082388 

Figure 1640: DNA287433, NP-006810.1, 212009js_at 
Figure 1641: PRO69690 

Figure 1642: DNA3 28708, NP„002678.1, 2 1203 6-S-at 
Figure 1643: PR084467 

Figure 1644: DNA103380, NP^003365.1, 212038-S-at 
Figure 1645: PRO4710 

Figure 1646: DNA328709, BC004151, 212048^^t 
Figure 1647: PR037676 

Figure 1648A-B: DNA254751, AB018353, 212074^t 
Figure 1649: DNA328710, HUMLAMA, 212086^-at 
Figure 1650A-B: DNA298616, NP.001839.1, 
212091-s_at 

Figure 1651: PRO71027 

Figure 1652: DNA154139, DNA154139, 212099^1 
Figure 1653: DNA328711, AK023154, 212115-at 
Figure 1654: PR084468 

Figure 1655: DNA328712, NP_006501.1, 212118-at 
Figure 1656: PR084469 

Figure 1657: DNA328713, AF100737, 212130-x^t 
Figure 1658: PRO84470 

Figure 1659: DNA328714, HSM801966, 212146jat 
Figure 1660A-B: DNA151915, BAA09764.1, 

212149_at 

Figure 1661: PR012758 

Figure 1662: DNA88630, AAA52701.1, 212154_at 
Figure 1663: PR02877 

Figure 1664: DNA328715, BC000950, 212160^1 
Figure 1665: DNA328716, HSM800707, 212179^t 
Figure 1666A~C: DNA255018, CAB61363.1, 

212207-at 

Figure 1667: PRO50107 

Figure 1668A-B: DNA328717, CAB70761.1, 

212232^t 

Figure 1669: PR084473 

Figure 1670: DNA196116, DNA1961 16, 212246^t 
Figure 1671A-B: DNA254262,NP_055 197.1, 
212255-s_at 

Figure 1672: PR049373 

Figure 1673: DNA3 2777 1, NP_ 10959 1.1, 212268^1 

Figure 1674: PR083737 

Figure 1675A-B: DNA328718, AAC39776.1, 

212285_s„at 

Figure 1676: PR084474 

Figure 1677: DNA328719, BC012895, 212295^-at 
Figure 1678: PR084475 

Figure 1679: DNA271103,NP_005796.1, 212296^t 

Figure 1680: PR059425 

Figure 1681A-B: DNA328720, HSA306929, 

212297^t 



26 



wo 2004/041170 



PCT/US2003/034312 



Figure 1682: PR084476 

Figure 1683A-^B: DNA328721, 1450005.12, 21 229 8 -at 

Figure 1684: PR084477 

Figure 1685A-B: DNA150464, BAA34466.1, 

212311^1 

Figure 1686: PRO 12270 

Figure 1687: DNA326808, BC019307, 212312jat 

Figure 1688: PR083141 

Figure 1689A-B: DNA124122, NP^005602.2, 

212332^t 

Figure 1690: PR06323 

Figure 1691: DNA287190, CAB43217,1, 212333^t 
Figure 1692: PR069476 

Figure 1693A-B: DNA255527, HUMTI227HC, 
212337^t 

Figure 1694: DNA328722, BC012469, 212341^t 
Figure 1695: PR084478 

Figure 1696: DNA328723, S47833, 212360^t 
Figure 1697: PR036682 

Figure 1698A-B: DNA328724, AB007856, 212367-at 
Figure 1699A-B: DNA327773, BAA25456.1, 

212368-at 

Figure 1700: PR083739 

Figure 1701A-C: DNA328725, AB007923, 212390-at 
Figure 1702A-B: DNA150950, BAA07645.1, 

212396_s_-at 

Figure 1703: PR012554 

Figure 1704A-B: DNA328726, BAA25466.2, 

212443^t 

Figure 1705: PRO84480 

Figure 1706: DNA328727, AB033105, 212453 
Figure 1707 A-B: DNA328728, 481567.2, 212458jat 
Figure 1708: PR084482 

Figure 1709: DNA15 1348, DNA15 1348, 212463^t 
Figure 1710: PR011726 

Figure 171 1A-: DNA328729, D80001, 212486^-at 

Figure 1712: PR038526 

Figure 1713A-B: DNA32S730, BAA74899.2, 

212492_s_at 

Figure 1714: PR084483 

Figure 1715A-B: DNA328731, 234169.5, 212500-at 
Figure 1716: PR084484 

Figure 1717: DNA328732, NP^l 16193.1, 212502^t 
Figure 1718: PR084485 
Figure 1719: DNAO, AF038183, 212527 jat 
Figure 1720: PRO 

Figure 1721: DNA328734, AAH01171.1, 212539jat 
Figure 1722: PR084487 
Figure 1723: DNA328735, PHIP, 212542_s-at 
Figure 1724: PR084488 

Figure 1725: DNA328736, BC009846, 212552^1 
Figure 1726: PR084489 

Figure 1727 A-D: DNA328737, 148650,1, 212560^t 
Figure 1728: PRO84490 

Figure 1729: DNA270260, HSPDCE2, 212568^-at 
Figure 1730A-B: DNA328738, BAA31625.1, 



212569^t 

Figure 1731: PR084491 

Figure 1732A-B: DNA328739, PTPRC, 212587-S-at 
Figure 1733: PR084492 

Figure 1734: DNA327776, 1379302.1, 21 2593^-at 
Figure 1735: PR083742 

Figure 1736: DNA15 1487, DNA 15 1487, 2 12594 jat 

Figure 1737: PR011833 

Figure 1738A-B: DNA328740, BAA76781.1, 

212611^1 

Figure 1739: PR084493 

Figure 1740: DNA81753, DNA81753, 212613jat 

Figure 1741: PR09216 

Figure 1742A-B: DNA253817, BAA20767.1, 

212615-at 

Figure 1743: PRO49220 

Figure 1744A-B: DNA328741, 474863.12, 212622^t 
Figure 1745: PR084494 

Figure 1746: DNA194679, BAA05062.1, 212623^t 
Figure 1747: PR023989 

Figure 1748A-B: DNA328742, 244522.6, 212628^t 
Figure 1749: PRO59047 

Figure 1750: DNA270683, NP-006247.1, 212629-S_at 

Figure 1751: PRO59047 

Figure 1752A-D: DNA327777, HSILIRECA, 

212657^_at 

Figure 1753A-B: DNA150762, BAA13197.1, 
212658^t 

Figure 1754: PR012455 

Figure 1755: DNA327838, NP_000568.1, 212659^^t 
Figure 1756: PR083789 

Figure 1757: DNA328743, 1234685.2, 212667 jat 
Figure 1758: PR084495 

Figure 1759: DNA3 28744, AF3 18364, 2 12680 jc_at 
Figure 1760: PR084496 

Figure 1761: DNA328745, 482138,6, 21 2687-at 
Figure 1762: PR084497 

Figure 1763: DNA324378, NP_000523.1, 212694_s„at 
Figure 1764: PROS 1047 

Figure 1765: DNA328746, CAB43213.1, 212698^_at 

Figure 1766: PR084498 

Figure 1767A-B: DNA328747, BAA83030.1, 

212765-at 

Figure 1768: PR084499 

Figure 1769A-B: DNA328748, HSJ001388, 212774_at 
Figure 1770: PRO59570 

Figure 1771: DNA328749, HSM802266, 212779-at 

Figure 1772: DNA328750, 7689361.1, 212812^t 

Figure 1773: PRO84500 

Figure 1774A-B:DNA328751,AF012086, 

212842_x>at 

Figure 1775: DNA328752, CAA76270,1, 212864«at 

Figure 1776: PRO84501 

Figure 1777A-B: DNA328753, BAA13212.1, 

212873^t 

Figure 1778: PRO84502 
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Figure 1779: DNA271630, DNA271630, 212907-at 
Figure 1780: DNA328754, 1397726.9, 2 1291 2 jat 
Figure 1781: PRO84503 
Figure 1782A-B: DNA328755, BAA25490.1, 

212946^t 

Figure 1783: PRO84504 

Figure 1784A-B: DNA328756, BAA74893.2, 

212975^t 

Figure 1785: PRO84505 

Figure 1786: DNA154982, DNA154982, 213034^t 
Figure 1787: DNA3 27785, BCO 17336, 2 13061 ^.at 
Figure 1788: PR083749 

Figure 1789A-C: DNA328757, 475076.9, 213069 jat 
Figure 1790: PRO84506 

Figure 1791 A-B: DNA328758, AB011123, 213109-at 
Figure 1792: DNA272600, NP-057259.1, 213112^_at 
Figure 1793: PRO60737 

Figure 1794: DNA326217, NP^004474.1, 213129^^t 
Figure 1795: PRO82630 

Figure 1796: DNA228053, DNA228053, 213158^t 
Figure 1797 A-G: DNA103535, AF027153, 213164^t 
Figure 1798: PR04862 

Figure 1799: DNA150875, CAB45717.1, 213246-at 
Figure 1800: PR011589 

Figure 1801: DNA328759, HUMLPACI09, 213258^t 
Figure 1802: DNA328760, 1376674.1, 21 3274-S_at 
Figure 1803: PRO84508 
Figure 1804A-B: DNA328761, BAA82991,1, 

213280-at 

Figure 1805: PRO84509 

Figure 1806: DNA260974, NP.006065.1, 213293»s_at 
Figure 1807: PRO54720 

Figure 1808: DNA328762, AAL30845.1, 213338-at 

Figure 1809: PRO84510 

Figure 1810: DNA327789, 1449824.5, 21 3348^t 
Figure 1811: PR083753 

Figure 1812: DNA328763, NP_001219.2, 213373^_at 

Figure 1813: PR084511 

Figure 1814: DNA328764, NP-438169.1, 213375^^t 
Figure 1815: PR084512 

Figure 1816: DNA328765, 411350,1, 213391 jat 
Figure 1817: PR084513 

Figure 1818: DNA106195, DNA106195, 213454^1 
Figure 1819: DNA327795, BC014226, 213457^t 
Figure 1820: DNA3 287 66, NP-006077.1, 2 13476 jat 
Figure 1821: PR084514 

Figure 1822: DNA328767, BC008767, 213501-at 
Figure 1823: PR084515 

Figure 1824: DNA254264, HSM800224, 213546-at 
Figure 1825: PR049375 

Figure 1826: DNA328768, 1194561.1, 2 13572^_at 
Figure 1827: PR084516 

Figure 1828: DNA327800, 1251176.10, 2 13593 ^.at 
Figure 1829: PR083763 

Figure 1830: DNA151422, DNA15 1422, 2 13605 js»at 
Figure 1831: PRO 11 792 



Figure 1832: DNA225974, NP_000864.1, 213620_s_at 
Figure 1833: PR036437 

Figure 1834: DNA328769, CAA69330.1, 213624^t 
Figure 1835: PR084517 

Figure 1836: DNA260173, DNA260173, 213638-at 

Figure 1837: PRO54102 

Figure 1838A-C: DNA273792, DNA273792, 

213649^t 

Figure 1839: DNA151886, CAB43234,1, 213682_at 
Figure 1840: PRO 12745 

Figure 1841: DNA227788, NP^002995.1, 213716js-at 
Figure 1842: PR038251 

Figure 1843: DNA328771, HSMYOSIE, 213733_at 
Figure 1844: DNA328772, AAC 19 149.1, 21 3761 
Figure 1845: PR084519 

Figure 1846: DNA328773, BC001528, 213766jc»at 
Figure 1847: PRO84520 

Figure 1848: DNA328774, NP-004263.1, 213793^_at 

Figure 1849: PRO60536 

Figure 1850A-B: DNA328775, NP-006540.2, 

213812-5_at 

Figure 1851: PR084521 

Figure 1852: DNA328776, 407661.4, 2138 17-at 
Figure 1853: PR084522 

Figure 1854A-B: DNA328777, IDN3, 213918-s_at 
Figure 1855: PR084523 

Figure 1856: DNA196110, DNA196110, 214016^^1 
Figure 1857: PR024635 

Figure 1858: DNAl 50990, NP_003632.1, 21 4022^^t 
Figure 1859: PRO 12570 

Figure 1860: DNA328778, 234498.37, 2 14093 js_at 

Figure 1861: PR084524 

Figure 1862A-B: DNA272292, NP_055459.1, 

214130-s_at 

Figure 1863: PRO60550 

Figure 1864: DNA82378, NP^002695.1, 214146-S^t 

Figure 1865: PR01725 

Figure 1866A-B: DNA328779, 332730.12, 

214155^_at 

Figure 1867: PR084525 

Figure 1868: DNA304659, NP-002023.1, 21421 l^t 
Figure 1869: PRO71086 

Figure 1870: DNA256662, NP^009112.1, 214219jc_at 
Figure 1871: PR051628 

Figure 1872A-B: DNA328780, 480940.15, 214285^t 
Figure 1873: PR084526 

Figure 1874: DNA32S781, 1453703.13, 214349 -at 
Figure 1875: PR084527 

Figure 1876: DNA273174, NP_00195L1, 214394jc_at 

Figure 1877: PR061211 

Figure 1878: DNA328782, 337794.1, 2 14405 ^t 
Figure 1879: PR084528 

Figure 1880: DNA287630, NP^OOO 160.1, 21 4430^t 
Figure 1881: PR02154 

Figure 1882: DNA227376, NP_005393.1, 214435jic_at 
Figure 1883: PR037839 
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Figure 1884: DNA273138, NP_005495.1, 214452-^t 
Figure 1885: PR061182 

Figure 1886: DNA327812, NP.006408.2, 214453^-at 
Figure 1887: PR083773 

Figure 1888: DNA302598, NP-066361.1, 214487^^t 
Figure 1889: PR062511 

Figure 1890: DNA3 28783, NP.00202 1.2, 2 14560 jat 

Figure 1891: PR084529 

Figure 1892: DNA324728, BC0I7730, 214581 jc_at 
Figure 1893: PR0868 

Figure 1894A-B: DNA328784, 331045.1, 214582^t 
Figure 1895: PRO84530 

Figure 1896: DNA328785, NP_004062.1, 214683-S>at 
Figure 1897: PR084531 

Figure 1898: DNA328786, BC017407, 214686^t 

Figure 1899: PR084532 

Figure 1900: DNA271990, DNA271990, 214722^t 
Figure 1901A-B: DNA274485, AB007863, 214735^1 
Figure 1902: DNA328787. 238292.8, 214746-s^t 
Figure 1903: PR084533 

Figure 1904: DNA328788, AK023937, 214763 jat 
Figure 1905: PR029183 

Figure 1906A-B: DNA328789, 344240.3, 214770_at 
Figure 1907: PR084534 

Figure 1908A-B: DNA328790, 481415.9, 214786^t 
Figure 1909: PR084535 

Figure 1910: DNA328791, 1383762.1, 21 4790^t 
Figure 1911: PR084536 

Figure 1912: DNA328792, 7692351.10, 214830^1 
Figure 1913: PROS4537 

Figure 1914: DNA328314, BC022780, 214841^t 
Figure 1915: PR084182 

Figure 1916: DNA83102, DNA83102, 214866 jat 
Figure 1917: PR02591 

Figure 1918: DNA161326, DNA161326, 214934^1 
Figure 1919: DNA328794, 1099353.2, 2 14974 jcjit 
Figure 1920: PR084539 

Figure 1921: DNA328795, AF057354, 214975^_at 
Figure 1922: DNA328796, HSM800535, 215078^t 
Figure 1923: DNA328797, 000092.6, 215087-at 
Figure 1924: PRO84540 

Figure 1925: DNA328798, NP..002088.1, 215091 -S-at 
Figure 1926: PR084541 

Figure 1927: DNA328799, BC008376, 215101^-at 
Figure 1928: PR01721 

Figure 1929: DNA270522, NP.006013,1, 2151 1 l-s^at 
Figure 1930: PR058899 

Figure 1931: DNA328800, 194537.1, 215224^t 

Figure 1932: PR084542 

Figure 1933A-B: DNA327827, HSM800826, 

215235^1 

Figure 1934A-B: DNA226905, NP>.055672.1, 

215342_s.at 

Figure 1935: PR037368 

Figure 1936: DNA327831, NP.076956,1, 215380^_at 
Figure 1937: PR083783 



Figure 1938: DNA328801, 407831.1, 215392_at 
Figure 1939: PR084543 

Figure 1940A-B: DNA328802, C6orf5, 21541 l^.at 
Figure 1941: PR084544 

Figure 1942: DNA275385, NP.002085.1, 215438-jc_at 
Figure 1943: PRO63048 

Figure 1944: DNA328803, BAA9 1443.1, 215440js-at 
Figure 1945: PR084545 

Figure 1946: DNA328804, 403621.1, 21 5767^t 

Figure 1947: PR084546 

Figure 1948A-B: DNA328805, BAA86482.1, 

215785^_at 

Figure 1949: PR084547 

Figure 1950: DNA328806, 208045.1, 21 6109-^t 
Figure 1951: PR084548 

Figure 1952: DNA269532,NP»004802.1, 216250js^t 
Figure 1953: PR057948 

Figure 1954: DNA328807, AAH10129.1, 216483-S_at 
Figure 1955: PR084549 

Figure 1956: DNA188349, NP-002973.1, 216598^_at 

Figure 1957: PR021884 

Figure 1958: DNA328808, 1099517.2, 216607^-at 
Figure 1959: PRO84550 

Figure 1960: DNA328809, PTPN12, 216915-5^t 
Figure 1961: PRO4803 

Figure 1962: DNA328810, NP.OO 1770.1, 21 6942^_at 
Figure 1963: PR02557 

Figure 1964A-C: DNA328811, NP,002213.1, 

216944_s_at 

Figure 1965: PR084551 

Figure 1966: DNA328812, BAA86575.1, 216997 jc^t 

Figure 1967: PR084552 

Figure 1968A-B: DNA328813, BAA76774.1, 

217118^_at 

Figure 1969: PR084553 

Figure 1970A-B: DNA328814, HUMMHHLAJC, 

217436jx:_at 

Figure 1971A~B: DNA328815, 331104.2, 217521_at 
Figure 1972: PR084554 

Figure 1973: DNA328816, 1446567.1, 217526^t 

Figure 1974: PR084555 

Figure 1975A-B: DNA255619, AF054589, 

217599_s.at 

Figure 1976: PRO50682 

Figure 1977: DNA327848, NP-005998.1, 217649-at 

Figure 1978: PR083793 

Figure 1979: DNA328817, 1498470.1, 217678-at 
Figure 1980: PR084556 

Figure 1981: DNA328818, NP^07 1435.1, 21 7730_^t 
Figure 1982: PR038175 

Figure 1983: DNA327935, NP-079422.1, 217745_S-at 
Figure 1984: PR083866 

Figure 1985A-B: DNA88040, NP.-000005.1, 217757^t 
Figure 1986: PR02632 

Figure 1987A-B: DNA88226, NP-000055.1, 211161 jaX 
Figure 1988: PR02237 
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Figure 1989: DNA325821, NP-057016.1, 217769 _s^t 
Figure 1990: PR082287 

Figure 1991: DNA227358, NP_057479.1, 217777_S-at 
Figure 1992: PR037821 

Figure 1993: DNA328819, NP-,057145.1. 217783-S_at 
Figure 1994: PR084557 

Figure 1995: DNA327850, NP_006546.1, 217785-S_at 
Figure 1996: PRO60803 

Figure 1997: DNA328303, NP.056525.1, 217807^_at 
Figure 1998: PR084173 

Figure 1999: DNA328820, NP^077022.1, 217808-S^t 
Figure 2000: PR084558 

Figure 2001: DNA328821, NP.006708.1, 217813-S-at 
Figure 2002: PR084559 

Figure 2003: DNA328822, AK001511, 217830js^t 

Figure 2004: PRO84560 

Figure 2005: DNA328823, NP_057421.1, 217838^.at 
Figure 2006: PR084561 

Figure 2007: DNA226759, NP^054775.1, 217845 je^at 
Figure 2008: PR037222 

Figure 2009: DNA327939, NP-.060654.1, 217852^^t 

Figure 2010: PR083869 

Figure 2011A-B: DNA324921, NP-073585.6, 

217853^t 

Figure 2012: PR08 1523 

Figure 2013: DNA328824, DREVl, 217868-S_at 
Figure 2014: PR084562 

Figure 2015: DNA225604, NP-057226.1, 217869^t 
Figure 2016: PRO36067 

Figure 2017: DNA326937, NP_002406.1, 217871-S-at 
Figure 2018: PR083255 

Figure 2019: DNA255145, NP^060917.1, 217882-at 
Figure 2020: PRO50225 

Figure 2021A-B: DNA328825, 1398762.1 1, 217886jat 
Figure 2022: PR084563 

Figure 2023: DNA189504, NP.064539.1, 217898jat 

Figure 2024: PRO25402 

Figure 2025: DNA328826, NP^004272.2, 217911 js_at 
Figure 2026: PR084564 

Figure 2027: DNA328827. NP-076869.1, 217949^^t 
Figure 2028: PR021784 

Figure 2029: DNA328828, NP^067027.1, 217956^_at 
Figure 2030: PR084565 

Figure 2031: DNA328829, NP-057230.1, 217959^.at 
Figure 2032: PR084566 

Figure 2033: DNA328830, NP-061 11 8.1, 217962_at 
Figure 2034: PR084567 

Figure 2035: DNA327855, NP.057387.1, 217975^t 
Figure 2036: PR083367 

Figure 2037: DNA328831, NP-057329.1, 217989_at 
Figure 2038: PR0233 

Figure 2039: DNA328832, NP.067022.1, 217995^t 

Figure 2040: PR084568 

Figure 2041: DNA328833, BC018929, 217996^1 
Figure 2042: PR084569 

Figure 2043: DNA328834, AF220656, 217997^t 



Figure 2044: DNA326005, NP_057004.1, 218007^_at 
Figure 2045: PR082446 

Figure 2046: DNA328835, NP-068760.1, 218019-S-at 
Figure 2047: PR084571 

Figure 2048: DNA328836, NP-.054894.1, 218027^t 
Figure 2049: PR084572 

Figure 2050: DNA328837,NP„057 149.1, 218046-S-at 

Figure 2051: PR081876 

Figure 2052: DNA328838, NP_054797.2, 218049_s^t 
Figure 2053: PRO70319 

Figure 2054: DNA328839, NP-057 180.1, 218059^1 
Figure 2055: PR084573 

Figure 2056: DNA328840, NP-060481.1, 218067-S-at 
Figure 2057: PR084574 

Figure 2058: DNA328841, NP-060557.2, 218073^-at 

Figure 2059: PR084575 

Figure 2060A-C: DNA328842, 235943.8, 218098^1 
Figure 2061: PR084576 

Figure 2062: DNA328843, NP^060939.1, 218099-at 
Figure 2063: PR084577 

Figure 2064: DNA328844, NP»061156.1, 2181Il^^t 
Figure 2065: PR082111 

Figure 2066: DNA227498, NP-002125.3, 218120_S-at 
Figure 2067: PR037961 

Figure 2068: DNA328845, NP.060615.1, 218126^t 
Figure 2069: PRO 10274 

Figure 2070: DNA227264, LOC51312, 218136^.at 
Figure 2071: PR037727 

Figure 2072: DNA327857, NP.057386.1, 218142^_at 
Figure 2073: PR083799 

Figure 2074: DNA325852, NP-.078813.1, 218153jat 
Figure 2075: PR0823 14 

Figure 2076: DNA328846, NP-060522.2, 218169 
Figure 2077: PR084578 

Figure 2078: DNA228094, NP-079416.1, 218175^t 
Figure 2079: PR038557 

Figure 2080: DNA328847, NP»056338.1, 218194-at 
Figure 2081: PR084579 

Figure 2082: DNA150593, NP^054747.1, 218196^t 
Figure 2083: PR012353 

Figure 2084: DNA256555, NP^060042.1, 218205^^t 
Figure 2085: PR051586 

Figure 2086: DNA328848, NP_004522.1, 218212^^t 
Figure 2087: PRO84580 

Figure 2088: DNA271622, NP.006020.3, 218224 jat 
Figure 2089: PRO59909 

Figure 2090: DNA324353, NP.004538.2, 218226-S-at 
Figure 2091: PRO81026 

Figure 2092: DNA328849, NP_057075.1, 218232_at 
Figure 2093: PR04382 

Figure 2094: DNA328850, NP.057187.1, 218254^_at 
Figure 2095: PR084581 

Figure 2096: DNA273230, NP.060914,1, 218273-S-at 
Figure 2097: PR061257 

Figure 2098: DNA328851, ISiP-068590.1, 218276^_at 
Figure 2099: PR084582 
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Figure 2100: DNA323953, NP-003507.1, 218280^^t 
Figure 2101: PRO80685 

Figure 2102: DNA254824, AF267865, 218294_s_at 

Figure 2103: PRO49920 

Figure 2104A-B: DNA328852, NP„003609.2, 

218311^1 

Figure 2105: PR084583 

Figure 2106A-B: DNA328853. NP-065702.2, 

218319^t 

Figure 2107: PR084584 

Figure 2108: DNA328854, NP.056979.1, 218350^^t 
Figure 2109: PR084585 

Figure 2110: DNA328855, NP»076952.1, 218375^t 
Figure 2111: PR09771 

Figure 21 12: DNA328856, NP_068376.1, 218380^t 
Figure 21 13: PR084586 

Figure 21 14: DNA328857, NP„037481.1, 218407jc_at 
Figure 21 15: PR084587 

Figure 21 16: DNA324953, NP„057412.1, 218412^^t 

Figure 21 17: PROS 1550 

Figure 21 18A-B: DNA255062, NP_060704.1. 

218424^.at 

Figure 21 19: PRO50149 

Figure 2120: DNA150661, NP-057162.1, 218446-S-at 
Figure 2121: PR012398 

Figure 2122: DNA326218, NP_064573.1, 218447-at 
Figure 2123: PR082631 
Figure 2124: DNA328858, HEBPl, 218450-at 
Figure 2125: PR084588 

Figure 2126: DNA327942, NP_060596.1, 218465-at 
Figure 2127: PRO83870 

Figure 2128: DNA328859, AF154054, 218468^_at 

Figure 2129: PRO1608 

Figure 2130A-B: DNA328860, NP-037504,1, 

218469-at 

Figure 2131: PRO1608 

Figure 2132: DNA328861, NP-057030.2, 218472-S,at 
Figure 2133: PR084589 

Figure 2134: DNA328862, NP-057626.2, 218499-at 
Figure 2135: PRO84590 

Figure 2136: DNA328863, NP-060264.1, 218503 jat 
Figure 2137: PR084591 

Figure 2138: DNA328864, NP-060726.2, 218512^t 
Figure 2139: PR084592 

Figure 2140: DNA255432, NP„060283.1, 218516-S^t 
Figure 2141: PRO50499 

Figure 2142: DNA194326, NP^065713,1, 218538js^t 
Figure 2143: PRO23708 

Figure 2144: DNA328865, NP^064587.1, 218557-at 
Figure 2145: PR084593 

Figure 2146: DNA328866, NP.005691.1, 218567 JC-at 
Figure 2147: PR069644 

Figure 2148: DNA328867, NP-085053.1, 218600^t 
Figure 2149: PR084594 

Figure 2150: DNA328868, NP-057629.1, 218611^t 
Figure 2151: PR084595 



Figure 2 1 52: DNA328869, NP_060892. 1 , 2 1 86 1 3 
Figure 2153: PR084596 

Figure 2154: DNA328870, NP-060639,U 218614jat 
Figure 2155: PR084597 

Figure 2156: DNA256870, NP.073600.1, 218618^^1 
Figure 2157: PRO51800 

Figure 2158: DNA254898, NP.060840.1, 218627 
Figure 2159: PR049988 

Figure 2160: DNA3 28 87 1, NP_068378.1, 218631^1 
Figure 2161: PR084598 

Figure 2162: DNA328872, NP-036528,1, 218634^t 
Figure 2163: PR084599 

Figure 2164: DNA328873, NP.057041.1, 218698jat 
Figure 2165: PRO84600 

Figure 2166: DNA324621, NP-054754.1, 218705-S_at 

Figure 2167: PRO 1285 

Figure 2168: DNA328874, NP^054778.1, 218723^^t 
Figure 2169: PRO84601 

Figure 2170: DNA328875, NP^064554.2, 218729jat 
Figure 2171: PRO84602 

Figure 2172: DNA328876, NP„060582.1, 218772 jc^t 
Figure 2173: PRO84603 

Figure 2174: DNA328877, BC020507, 218821^t 
Figure 2175: PRO84604 

Figure 2176: DNA328878, NP_060104.1, 218823-S_at 
Figure 2177: PRO84605 

Figure 2178: DNA328879, NP^064570.1, 218845^t 
Figure 2179: PRO84606 

Figure 2180: DNA227367, NP.062456.1. 218853^_at 
Figure 2181: PRO37830 

Figure 2182: DNA327872, NP-.057713.1, 218856^t 
Figure 2183: PR083812 

Figure 2184: DNA328880, NP„060369.1, 218872^t 
Figure 2185: PRO84607 

Figure 2186: DNA328881, NP_057706.1, 218890_?c_at 
Figure 2187: PR049469 

Figure 2188: DNA287166, NP„055129.1, 218943js_at 
Figure 2189: PR069459 

Figure 2190: DNA328882, NP_109589.1, 218967^^t 
Figure 2191: PR061822 

Figure 2192: DNA32721 1, NP-075053.1, 218989-K.at 
Figure 2193: PRO71052 

Figure 2194: DNA255929, NP_060935.1, 218992^t 
Figure 2195: PRO50982 

Figure 2196: DNA328883, NP-037474.1, 218996^t 
Figure 2197: PRO84608 

Figure 2198: DNA227194, FUllOOO, 218999>at 
Figure 2199: PR037657 

Figure 2200: DNA328884, NP_054884.1, 219006_at 
Figure 2201: PRO84609 

Figure 2202: DNA227187,NP_057703.1, 219014-at 
Figure 2203: PRO37650 

Figure 2204: DNA328885, NP»061 108.2, 219017_at 

Figure 2205: PRO50294 

Figure 2206A-B: DNA255239, NP.004832.1, 

219026js_at 
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Figure 2207: PRO503 16 

Figure 2208: DNA328886, NP-078811.1, 219040jat 
Figure 2209: PRO84610 ' 

Figure 2210: DNA328887, NP-061907.1, 219045^t 

Figure 2211:PR084611 

Figure 2212: DNA328888, NP_060436.1, 219053^_at 
Figure 2213: PR0846 12 

Figure 2214: DNA328889, NP-006005,1, 219061^^t 
Figure 2215: PR084613 

Figure 2216: DNA328890, NP-060403.1, 219093^t 
Figure 2217: PR084614 

Figure 2218: DNA327877, NP-065108.1, 219099 

Figure 2219: PR083816 

Figure 2220: DNA328891, NP-060263.1, 219143^_at 
Figure 2221: PR084615 

Figure 2222: DNA210216, NP-006860.1, 219150jS-^t 
Figure 2223: PR033752 

Figure 2224: DNA328892, NP-067643.2, 219165jat 

Figure 2225: PR084616 

Figure 2226A-B: DNA328893, NP^065699.1, 

219201_s„at 

Figure 2227: PR09914 

Figure 2228: DNA287235, NP_060598.1, 219204 js-at 
Figure 2229: PR069514 

Figure 2230: DNA225594, NP..037404.1, 219229 

Figure 2231: PRO36057 

Figure 2232: DNA328894, NP-060796.1, 219243^t 
Figure 2233: PR084617 

Figure 2234: DNA328895, NP-071762.2, 219259^t 
Figure 2235: PRO 13 17 

Figure 2236: DNA328896, NP-079037.1, 219265^t 
Figure 2237: PR084618 

Figure 2238: DNA328897, TRPV2, 219282_5-at 
Figure 2239: PR012382 

Figure 2240: DNA273489, NP>-055210,1, 219290-x_at 

Figure 2241: PR061472 

Figure 2242A-B: DNA328898, NP.060261.1, 

219316-5.at 

Figure 2243: PR084619 

Figure 2244: DNA328899, NP^061024.1, 219326^.at 

Figure 2245: PRO84620 

Figure 2246A-B: DNA255889, NP_061764.1, 

219340_s_at 

Figure 2247: PRO50942 

Figure 2248: DNA328900, NP_060814.1, 219344^1 
Figure 2249: PR084621 

Figure 2250: DNA254518, NP_057354.1, 219371^_at 
Figure 2251: PR049625 

Figure 2252: DNA188342, NP^064510.1, 219385^t 

Figure 2253: PR021718 

Figure 2254: DNA256417, NP-077271.1, 219402^_at 

Figure 2255: PR051457 

Figure 2256A-B: DNA327887, NP-006656.1, 

219403^_at 

Figure 2257: PR083823 

Figure 2258: DNA327888,NP-07 1732.1, 219412^1 



Figure 2259: PR083824 

Figure 2260: DNA328901, FU20533, 219449 jslat 
Figure 2261: PR084622 

Figure 2262: DNA328902, NP_071750.1, 219452jat 

Figure 2263: PR084623 

Figure 2264: DNA328903, NP-002805.1, 219485^ jai 
Figure 2265: PR084624 

Figure 2266: DNA328904, NP.076941.1, 219491 jat 

Figure 2267: PR084625 

Figure 2268A-B: DNA328905, NP.075392,1, 

219496^t 

Figure 2269: PR084626 

Figure 2270: DNA328906, NP-078855.1, 219506-at 
Figure 227 1:PR084627 

Figure 2272: DNA328907, NP-000067.1, 219534 Jt^t 

Figure 2273: PR084628 

Figure 2274: DNA328908, 7691567.2, 219540^t 
Figure 2275: PR084629 

Figure 2276: DNA225636, NP.065696.1, 219557-S-at 

Figure 2277: PRO36099 

Figure 2278A-B: DNA328909, NP.078800.2, 

219558^t 

Figure 2279: PRO84630 

Figure 2280: DNA328910, NP.057666.1, 219593 
Figure 2281: PR038848 

Figure 2282: DNA3 28911, MS4A4A, 219607^.at 
Figure 2283: PR084631 

Figure 2284: DNA328912, NP-.060287.1, 219622-at 
Figure 2285: PR084632 

Figure 2286: DNA328913, NP.079213.1, 21963 l^t 
Figure 2287: PR084633 

Figure 2288: DNA328914, NP^060883.1, 219634^t 
Figure 2289: PR036664 , 

Figure 2290: DNA327892, NP-060470.1, 219648-^t 
Figure 2291: PR083828 

Figure 2292: DNA328915, NP-055056.2, 219654»at 
Figure 2293: PR084634 

Figure 2294: DNA228002, NP.071744.1, 219666^t 
Figure 2295: PR038465 

Figure 2296: DNA328916, NP.071932.1, 219678 jc^t 
Figure 2297: PR084635 

Figure 2298: DNA287206, NP^060124.1, 219691^t 
Figure 2299: PR069488 

Figure 2300: DNA328917, NP-061838.1, 219725^t 
Figure 2301: PRO7306 

Figure 2302: DNA328918, NP„078935.1, 219770^t 
Figure 2303: PR084636 

Figure 2304: DNA328919, NP».078987.1, 219777-at 
Figure 2305: PR084637 

Figure 2306: DNA227152, NP-038467.1, 219788^t 
Figure 2307: PR037615 

Figure 2308: DNA328920, NP_061 129.1, 219837^„at 
Figure 2309: PR04425 

Figure 2310: DNA256033, NP„060164.1, 219858_S-at 
Figure 2311: PRO51081 

Figure 2312: DNA254838, NP^078904,1, 219874-at 
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Figure 2313: PR049933 

Figure 2314: DNA328921, NP-057079.1, 219878 js^t 
Figure 2315: PR084638 

Figure 2316: DNA256325, NP-005470.1, 219889-at 

Figure 2317: PR051367 

Figure 2318: DNA328922, NP-.037384.1, 219890^1 
Figure 23 19: PR084639 

Figure 2320: DNA328923, NP-.075379.1, 219892_at 
Figure 2321: PRO84640 

Figure 2322: DNA256608, NP.060408.1. 219895^t 
Figure 2323: PR051611 

Figure 2324: DNA328924, NP-.057150.2, 219933^t 

Figure 2325: PR084641 

Figure 2326: DNA255456, NP^057268.1, 219947^t 
Figure 2327: PRO50523 

Figure 2328: DNA227804, NP-065394.1, 219952 js^t 
Figure 2329: PROS 8267 

Figure 2330: DNA328925, NP-076403.1, 220005^t 
Figure 2331: PR084642 

Figure 2332: DNA256467, NP_079054,1, 220009-at 

Figure 2333: PRO51504 

Figure 2334A-B: DNA292946, NP-061 160.1, 

220023^t 

Figure 2335: PRO70613 

Figure 2336: DNA171414, NP.009130.1, 220034^t 
Figure 2337: PRO20142 

Figure 2338: DNA328926, NP_064703.1, 220046-s^t 

Figure 2339: PR084643 

Figure 2340A-B: DNA221079, NP-,071445.1, 

220066-at 

Figure 2341: PR034753 

Figure 2342: DNA256091, NP>.071385.1, 220094_s-at 
Figure 2343: PR051141 

Figure 2344: DNA328927, NP.078993.2, 220122^t 
Figure 2345: PR084644 

Figure 2346: DNA328928, NP-068377,1, 220178^t 

Figure 2347: PR084645 

Figure 2348: DNA324716, NP_463459.1, 220I89^-at 
Figure 2349: PROS 1347 

Figure 2350: DNA228059, NP„073742.1, 220199-s.at 
Figure 2351: PR038522 

Figure 2352: DNA328929, NP_060375.1, 220240-s_at 

Figure 2353: PR084646 

Figure 2354A-B: DNA328930, NP-038465.1, 

220253 _s_at 

Figure 2355: PR023525 

Figure 2356: DNA328931, NP-004226.1, 220266^_at 
Figure 2357: PR084647 

Figure 2358: DNA328932, NP-079057.1, 220301 
Figure 2359: PR084648 

Figure 2360: DNA328933, NP.057466.1, 220307^t 
Figure 2361: PR09891 

Figure 2362: DNA256735, NP-060175.1, 220333-at 
Figure 2363: PR051669 

Figure 2364A-B: DNA328934, EML4, 220386jS-at 
Figure 2365: PR084649 



Figure 2366: DNA328935, NP_009002.1, 220387-S_at 
Figure 2367: PRO84650 

Figure 2368: DNA254861, MCOLN3, 220484-at 
Figure 2369: PR049953 

Figure 2370: DNA328936, NP-066998.1, 220491 jat 
Figure 2371: PRO 1003 

Figure 2372: DNA328937, PHEMX, 220558 jx^at 
Figure 2373: PRO12380 

Figure 2374: DNA328938, NP^060617.1, 220643js_at 
Figure 2375: PR084651 

Figure 2376: DNA323756, NP-057267.2, 220688-S^t 
Figure 2377: PRO80512 

Figure 2378: DNA328939, NP-008834.1, 220741^^t 
Figure 2379: PR084652 

Figure 2380: DNA288247, NP^478059.1, 220892^-.at 
Figure 2381: PRO70011 

Figure 2382: DNA328940, NP_078893.1, 220933-S_at 
Figure 2383: PR084653 

Figure 2384: DNA328941, NP-055218.2, 220937-s^t 
Figure 2385: PR084654 

Figure 2386: DNA327953, NP.055182.2, 220942jc_at 
Figure 2387: PR083878 
Figure 2388A-B: DNA323882, NP-000692.2, 
220948 ^_at 

Figure 2389: PRO80625 

Figure 2390: DNA327917, NP_1 12240.1, 220966 JC^t 
Figure 2391: PR083852 

Figure 2392: DNA328942, NP.l 12216.2, 220985^-at 
Figure 2393: PR084655 

Figure 2394: DNA328943, NP_036566.1, 221041-S_at 
Figure 2395: PRO51680 

Figure 2396: DNA328944, NP..060554.1, 221078^_at 
Figure 2397: PR084656 

Figure 2398: DNA328945, NP»079177.2, 221081 js_at 
Figure 2399: PR084657 

Figure 2400: DNA328946, NP-055 164.1, 221087_s_at 
Figure 2401: PR012343 

Figure 2402: DNA328947, NP„055245.1, 221 188^_at 
Figure 2403: PR084658 

Figure 2404: DNA257293, NP_1 10396. 1, 221210.^^1 
Figure 2405: PR051888 

Figure 2406: DNA327920, NP„1 10431.1, 221245^-at 

Figure 2407: PR083855 

Figure 2408A-C: DNA328287, NP-072174.2, 

221246jc_at 

Figure 2409: PR084163 

Figure 2410: DNA328948, NP_110437.1, 221253js^t 
Figure 241 1 : PR084659 

Figure 2412: DNA256432, NP„1 10415.1, 221266^_.at 
Figure 2413: PR051471 

Figure 2414: DNA328027, NP^l 12570.2, 221437^-at 

Figure 2415: PR083944 

Figure 2416A-B: DNA272014, AF084555, 

221482_s^at 

Figure 2417: PRO60289 

Figure 2418: DNA328949, AF157510. 221487^^t 
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Figure 2419: PRO84660 

Figure 2420: DNA328950, NP»057025,1, 221504-S^t 

Figure 2421: PR084661 

Figure 2422A-B: DNA328951, HSM802232, 

221523^_at 

Figure 2423: PR084662 

Figure 2424: DNA328952, NP^067067.1, 221524^^t 

Figure 2425: PR084663 

Figure 2426A-B: DNA273901, NP^l 10389.1, 

221530_s-at 

Figure 2427: PR061855 

Figure 2428: DNA274676, DKFZp564A176Homo, 

221538^_at 

Figure 2429: DNA328953, NP-004086.1, 221539-at 

Figure 2430: PRO70296 

Figure 2431 A-B: DNA328954, NP-.1 13664.1, 

221541_at 

Figure 2432: PR09851 

Figure 2433 A-B: DNA269992, HUMACYLCOA, 
221561^t 

Figure 2434: PR058388 

Figure 2435: DNA328955, NP_054887.1, 221570^_at 
Figure 2436: PR084664 

Figure 2437A-B: DNA328956, AFl 10908, 221571 _at 
Figure 2438: DNAl 88321, NP^004855.1, 221577^-at 
Figure 2439: PR021896 

Figure 2440: DNA328957, WBSCR5, 221581^-at 
Figure 2441: PR023859 

Figure 2442: DNA328958, BC001663, 221593^_at 
Figure 2443: PR084665 

Figure 2444: DNA328959, NP^077027.1, 221620^_at 
Figure 2445: PRO4302 

Figure 2446: DNA254777, NP-055140.1, 221676_s^t 
Figure 2447: PR049875 

Figure 2448: DNA327526, NP-065727.2, 221679^_at 
Figure 2449: PR083574 

Figure 2450: DNA328960, NP-076426.1, 221692js_at 
Figure 2451: PR084666 

Figure 2452: DNA327929, AK001785, 221748^-at 
Figure 2453: PR083861 

Figure 2454: DNA328961, NP.4431 12.1, 221756-at 

Figure 2455: PR084667 

Figure 2456: DNA328962, BC021574, 221759:at 
Figure 2457: PR082746 

Figure 2458A~B: DNA328963, 328765.9, 221760^t 
Figure 2459: PR084668 

Figure 2460A-B: DNA327930, 1455324.9, 221765^t 
Figure 2461: PR083862 

Figure 2462: DNA328964, AK056028, 221770^t 

Figure 2463: PR084669 

Figure 2464A-C: DNA328965, AB051505, 221778^t 
Figure 2465A-B: DNA328966, BAB14908.1, 
221790^_at 



Figure 2466: PRO84670 

Figure 2467: DNA328967, BC017905, 221815^t 
Figure 2468: PR084671 

Figure 2469: DNA274058, NP.057203.1, 22I816^_at 
Figure 2470: PR061999 

Figure 2471A-B: DNA328968, 1322249.6, 221830^t 
Figure 2472: PR06251 1 

Figure 2473: DNA272419, AF105036, 221841_S-at 
Figure 2474: PRO60672 

Figure 2475: DNA299882, DNA299882, 221872-at 
Figure 2476: PRO70856 
Figure 2477: DNA328969, 334394.2, 221 878 
Figure 2478: PR084672 

Figure 2479: DNA327933, 1452741.11, 221 899^t 
Figure 2480: PR083865 

Figure 2481: DNA328970, NP_057696.1, 221920^_at 

Figure 2482: PR084673 

Figure 2483: DNA328971, AK000472, 221923 .S-at 
Figure 2484: PR084674 

Figure 2485: DNA254787, AK023140, 221935^^t 
Figure 2486: PR049885 

Figure 2487: DNA3271 14, NP-006004.1, 221989^t 
Figure 2488: PR062466 

Figure 2489: DNA328972, BC009950, 222001_x^t 
Figure 2490: DNA328973, NP_1 15538.1, 222024^„at 
Figure 2491: PR082497 

Figure 2492: DNAl 19482, DNAl 19482, 222108>at 
Figure 2493: PRO9850 

Figure 2494: DNA328974, NP„061893,1, 222116-S^t 
Figure 2495: PR084676 

Figure 2496: DNA287209, NP-056350.1, 222154_s.-at 
Figure 2497: PRO69490 

Figure 2498: DNA328975, NP.078807.1, 222155_s_at 
Figure 2499: PR047688 

Figure 2500: DNA328976, BC019091, 222206^-at 
Figure 2501: PR084677 

Figure 2502: DNA256784, NP-075069.1, 222209-S-at 
Figure 2503: PR051716 

Figure 2504: DNA328977, NP-071344,1, 222216js_at 
Figure 2505: PR084678 

Figure 2506: DNA328978, NP.060373.1, 222244_S-at 
Figure 2507: PR084679 

Figure 2508A-B: DNA328979, 006242.19, 222266^t 
Figure 2509: PRO84680 

Figure 2510: DNA328980, 7692031.1, 222273^t 
Figure 2511: PR084681 

Figure 2512: DNA328981, AF443871, 222294^^t 
Figure 2513: PR024633 

Figure 2514: DNA328982, 154391.1,222313^1 

Figure 2515: PR084682 

Figure 2516: DNA328983, 206335.1, 222366^t 
Figure 2517: PR084683 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
I. Definitions 

The tenns "PRO polypeptide" and "PRO" as used herein and when immediately followed by a 
5 numerical designation refer to various polypeptides, wherein the complete designation (i.e., PRO/number) 
refers to specific polypeptide sequences as described herein. The terms "PRO/number polypeptide" and 
"PRO/number" wherein the term "number" is provided as an actual numerical designation as used herein 
encompass native sequence polypeptides and polypeptide variants (which are further defbied herein). The 
PRO polypeptides described herein may be isolated from a variety of sources, such as from human tissue 

10 types or from another source, or prepared by recombinant or synthetic methods. The term "PRO 

polypeptide" refers to each individual PRO/number polypeptide disclosed herein. All disclosures in this 
specification which refer to the "PRO polypeptide" refer to each of the polypeptides individually as well as 
jointly. For example, descriptions of the preparation of, purification of, derivation of, formation of 
antibodies to or against, administration of, compositions containing, treatment of a disease with, etc., pertain 

15 to each polypeptide of the invention individually. The term "PRO polypeptide" also includes variants of the 
PRO/number polypeptides disclosed herein. 

A "native sequence PRO polypeptide" comprises a polypeptide having the same amino acid 
sequence as the corresponding PRO polypeptide derived from nature. Such native sequence PRO 
polypeptides can be isolated from nature or can be produced by recombinant or synthetic means. The term 

20 "native sequence PRO polypeptide" specifically encompasses naturally-occurring truncated or secreted 

forms of the specific PRO polypeptide (e.g., an extracellular domain sequence), naturally-occurring variant 
forms (e.g., alternatively spliced forms) and naturally-occurring allelic variants of the polypeptide. In 
various embodiments of the invention, the native sequence PRO polypeptides disclosed herein are mature or 
full-length native sequence polypeptides comprising the Ml-length amino acids sequences shown in the 

25 accompanying figures. Start and stop codons are shown in bold font and underliaed in tlie figures. 

However, while the PRO polypeptide disclosed in the accompanying figures are shown to begin with 
methionine residues designated herein as amino acid position 1 in the figures, it is conceivable and possible 
that other methionine residues located either upstream or downstream from the amino acid position 1 in the 
figures may be employed as the starting amino acid residue for the PRO polypeptides. 

30 The PRO polypeptide "extracellular domain" or '*ECD" refers to a form of the PRO polypeptide 

which is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a PRO polypeptide 
ECD will have less than 1% of such transmembrane and/or cytoplasmic domains and preferably, will have 
less than 0.5% of such domains. It will be understood tliat any transmembrane domains identified for the 
PRO polypeptides of the present iavention are identified pursuant to criteria routinely employed in tlie art for 

35 identifying that type of hydrophobic domain. The exact boundaries of a transmembrane domain may vary 

but most likely by no more than about 5 amino acids at either end of the domain as initially identified herein. 
Optionally, therefore, an extracellular domain of a PRO polypeptide may contain from about 5 or fewer 
amino acids on either side of the transmembrane domain/extracellular domain boundary as identified in the 
Examples or specification and such polypeptides, with or without the associated signal peptide, and nucleic 

40 acid encoding them, are contemplated by the present invention. 
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The approximate location of the "signal peptides" of the various PRO polypeptides disclosed herein 
are shown in the present specification and/or the accompanying figures. It is noted, however, that the C- 
terminal boundary of a signal peptide may vary, but most likely by no more than about 5 amino acids on 
either side of the signal peptide C-terminal boundary as initially identified herein, wherein the C-terminal 
boundary of the signal peptide may be identified pursuant to criteria routinely employed in the art for 
identifying that type of amino acid sequence element (e.g., Nielsen et aL, Prot. Eng. 10:1-6 (1997) and von 
Heinje et al., NucL Acids. Res. 14:4683-4690 (1986)). Moreover, it is also recognized that, in some cases, 
cleavage of a signal sequence fi:om a secreted polypeptide is not entirely uniform, resulting in more than one 
secreted species. These mature polypeptides, where the signal peptide is cleaved within no more than about 
5 amino acids on either side of the C-tenninal boundary of the signal peptide as identified herein, and the 
polynucleotides encoding them, are contemplated by the present invention. 

"PRO polypeptide variant" means an active PRO polypeptide as defined above or below having at 
least about 80% amino acid sequence identity with a full-length native sequence PRO polypeptide sequence 
as disclosed herein, a PRO polypeptide sequence lacking the signal peptide as disclosed herein, an 
extracellular domain of a PRO polypeptide, with or witliout the signal peptide, as disclosed herein or any 
other firagment of a fiilHength PRO polypeptide sequence as disclosed herein. Such PRO polypeptide 
variants include, for instance, PRO polypeptides wherein one or more amino acid residues are added, or 
deleted, at the N- or C-terminus of the full-length native amino acid sequence. Ordinarily, a PRO 
polypeptide variant will have at least about 80% amino acid sequence identity, alternatively at least about 
81% amino acid sequence identity, altematively at least about 82% amino acid sequence identity, 
alternatively at least about 83% amino acid sequence identity, altematively at least about 84% amino acid 
sequence identity, altematively at least about 85% amino acid sequence identity, altematively at least about 
86% amino acid sequence identity, altematively at least about 87% amino acid sequence identity, 
altematively at least about 88% amino acid sequence identity, altematively at least about 89% amino acid 
sequence identity, altematively at least about 90% amino acid sequence identity, altematively at least about 
91% amino acid sequence identity, alternatively at least about 92% amino acid sequence identity, 
altematively at least about 93% amino acid sequence identity, altematively at least about 94% amino acid 
sequence identity, altematively at least about 95% amino acid sequence identity, altematively at least about 
96% amino acid sequence identity, alternatively at least about 97% amino acid sequence identity, 
altematively at least about 98% amino acid sequence identity and altematively at least about 99% amino 
acid sequence identity to a fiiU-length native sequence PRO polypeptide sequence as disclosed herein, a 
PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain of a PRO 
polypeptide, with or without the signal peptide, as disclosed herein or any other specifically defined 
fi-agment of a full-length PRO polypeptide sequence as disclosed herein. Ordinarily, PRO variant 
polypeptides are at least about 10 amino acids in length, altematively at least about 20 amino acids in length, 
altematively at least about 30 amino acids in length, altematively at least about 40 amino acids in length, 
altematively at least about 50 amino acids in length, altematively at least about 60 amino acids in length, 
altematively at least about 70 amino acids in length, altematively at least about 80 amino acids in length, 
altematively at least about 90 amino acids in length, altematively at least about 100 amino acids in length. 
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alternatively at least about 150 amino acids in length, alternatively at least about 200 amino acids in length, 
alternatively at least about 300 amino acids in length, or more. 

"Percent (%) amino acid sequence identity" with respect to the PRO polypeptide sequences 
identified herein is defined as the percentage of amino acid residues in a candidate sequence that are 
5 identical with the amino acid residues in the specific PRO polypeptide sequence, after aligning the 

sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not 
considering any conservative substitutions as part of the sequence identity. Aligimient for purposes of 
determining percent amino acid sequence identity can be achieved in various ways that are within the skill in 
the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or 

10 Megalign (DNASTAR) software. Those skilled in the art can determine appropriate parameters for 

measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of 
the sequences being compared. For purposes herein, however, % amino acid sequence identity values are 
generated using the sequence comparison computer program ALIGN-2, wherein the complete source code 
for the ALIGN-2 program is provided in Table 1 below. The ALIGN-2 sequence comparison computer 

15 program was authored by Genentech, Inc. and the source code shown in Table 1 below has been filed with 
user documentation in the U.S. Copyright Office, Washington D.C., 20559, where it is registered under U.S. 
Copyright Registration No. TXU5 10087. The ALIGN-2 program is publicly available through Genentech, 
Inc., South San Francisco, California or may be compiled from the source code provided in Table 1 below. 
The ALIGN-2 program should be compiled for use on a UNIX operating system, preferably digital UNIX 

20 V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and do not vary. 

In situations where ALIGN-2 is employed for amino acid sequence comparisons, the % amiino acid 
sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B 
(which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % 
amino acid sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: 

25 

100 times the fraction X/Y i 

where X is the number of amino acid residues scored as identical matches by the sequence alignment 
program ALIGN-2 in that program's alignment of A and B, and where Y is the total number of amino acid 

30 residues in B. It will be appreciated that where the length of amino acid sequence A is not equal to the 

length of amino acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino 
acid sequence identity of B to A. As examples of % amino acid sequence identity calculations using this 
method, Tables 2 and 3 demonstrate how to calculate the % amino acid sequence identity of the amino acid 
sequence designated "Comparison Protein" to the amino acid sequence designated "PRO", wherein "PRO" 

35 represents the amino acid sequence of a hypothetical PRO polypeptide of interest, "Comparison Protein" 
represents tlie amino acid sequence of a polypeptide against which the "PRO" polypeptide of interest is 
being compared, and "X, " Y" and "Z" each represent different hypothetical amino acid residues. 

Unless specifically stated otherwise, all % amino acid sequence identity values used herein are 
obtained as described in the immediately preceding paragraph using the ALIGN-2 computer program. 

40 However, % amino acid sequence identity values may also be obtained as described below by using the WU- 
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BLAST-2 computer program (Altschul et al., Methods in Enzvmology 266:460-480 (1996)). Most of the 
WU-BLAST-2 search parameters are set to the default values. Those not set to default values, i.e., the 
adjustable parameters, are set with the following values: overlap span = 1, overlap fraction = 0.125, word 
threshold (T) = 1 1, and scoring matrix = BLOSUM62. When WU-BLAST-2 is employed, a % amino acid 
5 sequence identity value is determined by dividing (a) the number of matching identical amino acid residues 
between the amino acid sequence of the PRO polypeptide of interest having a sequence derived jfrom the 
native PRO polypeptide and the comparison amino acid sequence of interest (i.e., the sequence against 
which the PRO polypeptide of interest is being compared which may be a PRO variant polypeptide) as 
determined by WU-BLAST-2 by (b) the total number of amino acid residues of the PRO polypeptide of 

10 interest. For example, in the statement "a polypeptide comprising an the amino acid sequence A which has 
or having at least 80% amino acid sequence identity to the amino acid sequence B", the amino acid sequence 
A is the comparison amino acid sequence of interest and the amino acid sequence B is the amino acid 
sequence of the PRO polypeptide of iaterest. 

Percent amino acid sequence identity may also be determiaed using the sequence comparison 

15 program NCBI-BLAST2 (Altschul et al.. Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 

sequence comparison program may be downloaded from http://www.ncbi.nlm.nih.gov or otherwise obtained 
from the National Institute of Health, Bethesda, MD. NCBI-BLAST2 uses several search parameters, 
wherein all of those search parameters are set to default values including, for example, xmmask = yes, strand 
= all, expected occurrences =10, minimum low complexity lei^th = 15/5, multi-pass e-value = 0.01, 

20 constant for multi-pass = 25, dropoff for final gapped alignment = 25 and scoring matrix = BLOSUM62. 

In situations where NCBI-BLAST2 is employed for amino acid sequence comparisons, the % 
amino acid sequence identity of a given amino acid sequence A to, with, or against a given amino acid 
sequence B (which can alternatively be phrased as a given amino acid sequence A that has or comprises a 
certain % amino acid sequence identity to, with, or against a given amino acid sequence B) is calculated as 

25 follows: 

100 times the fraction XA^ 

where X is the number of amino acid residues scored as identical matches by the sequence alignment 
30 program NCBI-BLAST2 in that program's alignment of A and B, and where Y is the total number of amino 
acid residues in B. It will be appreciated that where the length of amino acid sequence A is not equal to tlie 
length of amino acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino 
acid sequence identity of B to A. 

"PRO variant polynucleotide" or "PRO variant nucleic acid sequence" means a nucleic acid 
35 molecule which encodes an active PRO polypeptide as defined below and which has at least about 80% 

nucleic acid sequence identity with a nucleotide acid sequence encoding a full-length native sequence PRO 
polypeptide sequence as disclosed herein, a fuU-length native sequence PRO polypeptide sequence lacldng 
the signal peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the 
signal peptide, as disclosed herein or any other fragment of a full-length PRO polypeptide sequence as 
40 disclosed herein. Ordinarily, a PRO variant polynucleotide will have at least about 80% nucleic acid 
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sequence identity, alternatively at least about 81% nucleic acid sequence identity, alternatively at least about 
82% nucleic acid sequence identity, alternatively at least about 83% nucleic acid sequence identity, 
alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid 
sequence identity, alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 
5 87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid sequence identity, 

alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid 
sequence identity, alternatively at least about 91% nucleic acid sequence identity, alternatively at least about 
92% nucleic acid sequence identity, alternatively at least about 93% nucleic acid sequence identity, 
alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 95% nucleic acid 
10 sequence identity, alternatively at least about 96% nucleic acid sequence identity, alternatively at least about 
97% nucleic acid sequence identity, alternatively at least about 98% nucleic acid sequence identity and 
alternatively at least about 99% nucleic acid sequence identity with a nucleic acid sequence encoding a full- 
length native sequence PRO polypeptide sequence as disclosed herein, a full-length native sequence PRO 
polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain of a PRO 

15 polypeptide, with or without the signal sequence, as disclosed herein or any otlier fragment of a full-length 
PRO polypeptide sequence as disclosed herein. Variants do not encompass the native nucleotide sequence. 

Ordinarily, PRO variant polynucleotides are at least about 30 nucleotides in lengtli, alternatively at 
least about 60 nucleotides in length, alternatively at least about 90 nucleotides in length, alternatively at least 
about 120 nucleotides in length, alternatively at least about 150 nucleotides in length, alternatively at least 

20 about 1 80 nucleotides in length, alternatively at least about 210 nucleotides in length, alternatively at least 
about 240 nucleotides in length, alternatively at least about 270 nucleotides in length, alternatively at least 
about 300 nucleotides in lengtli, alternatively at least about 450 nucleotides in length, alternatively at least 
about 600 nucleotides in length, alternatively at least about 900 nucleotides in length, or more. 

"Percent (%) nucleic acid sequence identity" with respect to PRO-encoding nucleic acid sequences 

25 identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with 
the nucleotides in the PRO nucleic acid sequence of interest, after aligning the sequences and introducing 
gaps, if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of 
determining percent nucleic acid sequence identity can be achieved in various ways that are within the skill 
in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or 

30 Megalign (DNASTAR) software. For purposes herein, however, % nucleic acid sequence identity values are 
generated using the sequence comparison computer program ALIGN-2, wherein the complete source code 
for the ALIGN-2 program is provided in Table 1 below. The ALIGN-2 sequence comparison computer 
program was authored by Genentech, Inc. and the source code shown in Table 1 below has been filed with 
user documentation in the U.S. Copyright Office, Washington D.C., 20559, where it is registered under U.S. 

35 Copyright Registration No. TXU510087. The ALIGN-2 program is publicly available through Genentech, 
Inc., South San Francisco, California or may be compiled from the source code provided in Table 1 below. 
The ALIGN-2 program should be compiled for use on a UNIX operating system, preferably digital UNIX 
V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and do not vary. 

In situations where ALIGN-2 is employed for nucleic acid sequence comparisons, the % nucleic 

40 acid sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence 
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D (which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % 
nucleic acid sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 

100 times the fraction W/Z 

5 

where W is the number of nucleotides scored as identical matches by the sequence alignment program 
ALIGN-2 in that program's alignment of C and D, and where Z is tibe total number of nucleotides in D. It 
will be appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid 
sequence the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence 

10 identity of D to C. As examples of % nucleic acid sequence identity calculations. Tables 4 and 5, 

demonstrate how to calculate the % nucleic acid sequence identity of the nucleic acid sequence designated 
"Comparison DNA" to the nucleic acid sequence designated "PRO-DNA", wherein "PRO-DNA" represents 
a hypothetical PRO-encoding nucleic acid sequence of interest, "Comparison DNA" represents the 
nucleotide sequence of a nucleic acid molecule against which the "PRO-DNA" nucleic acid molecule of 

15 interest is being compared, and "N", "L" and "V" each represent different hypothetical nucleotides. 

Unless specifically stated otherwise, all % nucleic acid sequence identity values used herein are 
obtained as described in ihe immediately preceding paragraph using the ALIGN-2 computer program. 
However, % nucleic acid sequence identity values may also be obtained as described below by using the 
WU-BLAST-2 computer program (Altschul et al.. Methods in Enzvmologv 266:460-480 (1996)). Most of 

20 the WU-BLAST-2 search parameters are set to the default values. Those not set to default values, i.e., the 
adjustable parameters, are set with the following values: overlap span = 1, overlap fraction = 0.125, word 
threshold (T) = 1 1, and scoring matrix = BLOSUM62. When WU-BLAST-2 is employed, a % nucleic acid 
sequence identity value is determined by dividing (a) the number of matching identical nucleotides between 
the nucleic acid sequence of the PRO polypeptide-encoding nucleic acid molecule of interest having a 

25 sequence derived from the native sequence PRO polypeptide-encoding nucleic acid and the comparison 
nucleic acid molecule of interest (i.e., the sequence against which the PRO polypeptide-encoding nucleic 
acid molecule of interest is being compared which may be a variant PRO polynucleotide) as determined by 
WU-BLAST-2 by (b) the total number of nucleotides of the PRO polypeptide-encoding nucleic acid 
molecule of interest. For example, in the statement "an isolated nucleic acid molecule comprising a nucleic 

30 acid sequence A which has or having at least 80% nucleic acid sequence identity to the nucleic acid 

sequence B", the nucleic acid sequence A is the comparison nucleic acid molecule of interest and the nucleic 
acid sequence B is the nucleic acid sequence of the PRO polypeptide-encoding nucleic acid molecule of 
interest. 

Percent nucleic acid sequence identity may also be determined using the sequence comparison 
35 program NCBI-BLAST2 (Altschul et al.. Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 

sequence comparison program may be downloaded from http://www.ncbi.nlm.nih.gov or otherwise obtained 
from the National Institute of Health, Bethesda, MD. NCBI-BLAST2 uses several search parameters, 
wherein all of those search parameters are set to default values includiog, for example, unmask = yes, strand 
= all, expected occurrences = 10, minimum low complexity length = 15/5, multi-pass e-value = 0.01, 
40 constant for multi-pass = 25, dropoff for final gapped alignment = 25 and scoring matrix = BLOSUM62. 
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In situations where NCBI-BLAST2 is employed for sequence comparisons, the % nucleic acid 
sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D 
(which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % 
nucleic acid sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 

5 

100 tunes the fraction W/Z 

where W is the number of nucleotides scored as identical matches by the sequence alignment program 
NCBI-BLAST2 in that program's alignment of C and D, and where Z is the total number of nucleotides in 

10 D. It will be appreciated that where the length of nucleic acid sequence C is not equal to the length of 

nucleic acid sequence D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid 
sequence identity of D to C. 

In other embodiments, PRO variant polynucleotides are nucleic acid molecules that encode an 
active PRO polypeptide and which are capable of hybridizing, preferably under stringent hybridization and 

15 wash conditions, to nucleotide sequences encoding a full-length PRO polypeptide as disclosed herein. PRO 
variant polypeptides may be those that are encoded by a PRO variant polynucleotide. 

"Isolated," when used to describe the various polypeptides disclosed herein, means polypeptide that 
has been identified and separated and/or recovered from a component of its natural environment. 
Contaminant components of its natural environment are materials that would typically interfere with 

20 diagnostic or therapeutic uses for the polypeptide, and may include enzymes, hormones, and other 

proteinaceous or non-proteinaceous solutes. In preferred embodiments, the polypeptide will be purified (1) 
to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid sequence by use of a 
spinning cup sequenator, or (2) to homogeneity by SDS-PAGE under non-reducing or reducing conditions 
using Coomassie blue or, preferably, silver stain. Isolated polypeptide includes polypeptide in situ within 

25 recombinant cells, since at least one component of the PRO polypeptide natural environment will not be 
present. Ordinarily, however, isolated polypeptide will be prepared by at least one purification step. 

An "isolated" PRO polypeptide-encoding nucleic acid or other polypeptide-encoding nucleic acid is 
a nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule 
with which it is ordinarily associated in the natural source of the polypeptide-encoding nucleic acid. An 

30 isolated polypeptide-encoding nucleic acid molecule is other than in the form or setting in which it is found 
in nature. Isolated polypeptide-encoding nucleic acid molecules therefore are distinguished from the 
specific polypeptide-encoding nucleic acid molecule as it exists in natural cells. However, an isolated 
polypeptide-encoding nucleic acid molecule includes polypeptide-encoding nucleic acid molecules 
contained in cells that ordinarily express the polypeptide where, for example, tlie nucleic acid molecule is in 

35 a chromosomal location different from that of natural cells. 

The term "control sequences" refers to DNA sequences necessary for the expression of an operably 
linked coding sequence in a particular host organism. The control sequences that are suitable for 
prokaryotes, for example, include a promoter, optionally an operator sequence, and a ribosome binding site. 
Eukaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers. 
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Nucleic acid is "operably linked" when it is placed into a functional relationship with another 
nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA 
for a polypeptide if it is expressed as a preproteui that participates in the secretion of the polypeptide; a 
promoter or enliancer is operably linked to a coding sequence if it affects the transcription of the sequence; 
5 or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate 

translation. Generally, "operably linked" means that the DNA sequences being Hnked are contiguous, and, 
in the case of a secretory leader, contiguous and in reading phase. However, enhancers do not have to be 
contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, the 
synthetic oligonucleotide adaptors or linkers are used in accordance with conventional practice. 

10 The term "antibody" is used in the broadest sense and specificaUy covers, for example, single anti- 

PRO monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anti-PRO antibody 
compositions withpolyepitopic specificity, single chain anti-PRO antibodies, and fragments of anti-PRO 
antibodies (see below). The term "monoclonal antibody" as used herein refers to an antibody obtained firom 
a population of substantially homogeneous antibodies, i.e., the individual antibodies comprising the 

15 population are identical except for possible naturally-occurring mutations that may be present in minor 
amounts. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, 
and generally is an empirical calculation dependent upon probe length, washing temperature, and salt 
concentration. In general, longer probes require higher temperatures for proper annealing, while shorter 

20 probes need lower temperatures. Hybridization generally depends on the ability of denatured DNA to 

reaimeal when complementary strands are present in an environment below their melting temperature. The 
higher the degree of desired homology between the probe and hybridizable sequence, the higher the relative 
temperature which can be used. As a result, it follows that higher relative temperatures would tend to make 
the reaction conditions more stringent, while lower temperatures less so. For additional details and 

25 explanation of stringency of hybridization reactions, see Ausubel et al., Current Protocols in Molecular 
Bioloev, Wiley Interscience Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, may be identified by those 
that: (1) employ low ionic strength and high temperature for washing, for example 0,015 M sodium 
chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during hybridization a 

30 denaturing agent, such as formamide, for example, 50% (v/v) fonnamide with 0.1% bovine serum 

albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 6.5 with 750 mM 
sodium chloride, 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 
0.075 M sodium citrate), 50 niM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's 
solution, sonicated salmon sperm DNA (50 |xg/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, with 

35 washes at 42°C in 0.2 x SSC (sodium chloride/sodiiim citrate) and 50% formamide at 55°C, followed by a 
high-stringency wash consisting of 0.1 x SSC containing EDTA at 55°C. 

"Moderately stringent conditions" may be identified as described by Sambrook et al., Molecular 
Cloning: A Laboratorv Manual, New York: Cold Spring Harbor Press, 1989, and include the use of washing 
solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less stringent that those 

40 described above. An example of moderately stringent conditions is overnight incubation at 3 7°C in a 
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solution comprising: 20% formamide, 5 x SSC (150 wM NaCl, 15 mM trisodium citrate), 50 niM sodium 
phosphate (pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon 
sperm DNA, followed by washing the filters in 1 x SSC at about 37-50°C. The skilled artisan will recognize 
how to adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length 
5 and the like. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a PRO 
polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope 
against which an antibody can be made, yet is short enough such that it does not interfere with activity of the 
polypeptide to which it is fused. The tag polypeptide preferably also is feirly unique so that the antibody 

10 does not substantially cross-react with other epitopes. Suitable tag polypeptides generally have at least six 
amino acid residues and usually between about 8 and 50 anodno acid residues (preferably, between about 10 
and 20 amino acid residues). 

As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the 
binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin 

15 constant domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the 
desired binding specificity which is other than the antigen recognition and binding site of an antibody (i.e., is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a 
ligand. The immunoglobulin constant domain sequence in the immunoadhesin may be obtained jfrom any 

20 immunoglobulin, such as IgG-1, IgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, 
IgD or IgM. 

"Active" or "activity" for the purposes herein refers to form(s) of a PRO polypeptide which retain a 
biological and/or an immunological activity of native or naturally-occurring PRO, wherein "biological" 
activity refers to a biological function (either inhibitory or stimulatory) caused by a native or naturally- 
25 occurring PRO other than the ability to induce the production of an antibody against an antigenic epitope 
possessed by a native or naturally-occurring PRO and an "immimological" activity refers to the ability to 
induce the production of an antibody against an antigenic epitope possessed by a native or naturally- 
occurring PRO. 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully 
30 blocks, inhibits, or neutralizes a biological activity of a native PRO polypeptide disclosed herein. In a 

similar manner, the term "agonist" is used in the broadest sense and includes any molecule that mimics a 
biological activity of a native PRO polypeptide disclosed herein. Suitable agonist or antagonist molecules 
specifically include agonist or antagonist antibodies or antibody fragments, fragments or amino acid 
sequence variants of native PRO polypeptides, peptides, antisense oligonucleotides, small organic 
35 molecules, etc. Methods for identifying agonists or antagonists of a PRO polypeptide may comprise 

contacting a PRO polypeptide with a candidate agonist or antagonist molecule and measuring a detectable 
change in one or more biological activities normally associated with the PRO polypeptide. 

"Treatment" refers to both therapeutic treatment and prophylactic or preventative measures, 
wherein the object is to prevent or slow down (lessen) the targeted pathologic condition or disorder. Those 
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in need of treatment include those already with the disorder as well as those prone to have the disorder or 
those in whom the disorder is to be prevented. 

"Clironic" administration refers to administration of the agent(s) in a continuous mode as opposed 
to an acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. 
"Intermittent" administration is treatment that is not consecutively done without interruption, but rather is 
cyclic in nature. 

"Mammal" for purposes of treatment refers to any animal classified as a mammal, including 
humans, domestic and farm animals, and zoo, sports, or pet animals, such as dogs, cats, cattle, horses, sheep, 
pigs, goats, rabbits, etc. Preferably, the mammal is human. 

Administration "in combination with" one or more further therapeutic agents includes simultaneous 
(concurrent) and consecutive administration in any order. 

"Carriers" as used herein include pharmaceutically acceptable carriers, excipients, or stabilizers 
which are nontoxic to the cell or manmial being exposed thereto at the dosages and concentrations 
employed. Often the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of 
physiologically acceptable carriers include buffers such as phosphate, citrate, and other organic acids; 
antioxidants including ascorbic acid; low molecular weight (less than about 10 residues) polypeptide; 
proteiias, such as serum albimiin, gelatin, or immunoglobulins; hydrophilic polymers such as 
polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, arginine or lysine; 
monosaccharides, disaccharides, and other carbohydrates including glucose, mamose, or dextrins; chelating 
agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt-forming counterions such as sodiiun; 
and/or nonionic surfactants such as TWEBN™, polyethylene glycol (PEG), and PLURONICS™. 

"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or 
variable region of the intact antibody. Examples of antibody fragments include Fab, Fab', F(ab')2, and Fv 
fragments; diabodies; linear antibodies (Zapata et al.. Proteta Eng. 8f 10^: 1057-1062 [1995]); single-chain 
antibody molecules; and multispecific antibodies formed from antibody fragments. 

Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" 
fragments, each with a single antigen-binding site, and a residual "Fc" fragment, a designation reflecting the 
ability to crystallize readily. Pepsin treatment yields an F(ab*)2 fragment that has two antigen-combining 
sites and is still capable of cross-linking antigen. 

"Fv" is the minimum antibody fragment which contains a complete antigen-recognition and - 
binding site. This region consists of a dimer of one heavy- and one light-chain variable domain in tight, non- 
covalent association. It is in this configuration tliat the three CDRs of each variable domain interact to 
define an antigen-binding site on the surface of the Vh-Vl dimer. Collectively, the six CDRs confer antigen- 
binding specificity to the antibody. However, even a single variable domain (or half of an Fv comprising 
only three CDRs specific for an antigen) has the ability to recognize and bmd antigen, although at a lower 
afiSnity than the entire binding site. 

The Fab fragment also contains the constant domain of the light chain and the first constant domain 
(CHI) of the heavy chain. Fab fragments differ from Fab' fragments by the addition of a few residues at the 
carboxy terminus of the heavy chain CHI domain including one or more cysteines from the antibody hinge 
region. Fab -SH is the designation herein for Fab* in which the cysteine residue(s) of the constant domains 
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bear a free thiol group. F(ab')2 antibody fragments originally were produced as pairs of Fab* fragments 
which have hinge cysteines between them. Other chemical couplings of antibody fragments are also known. 

The "light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned to 
one of two clearly distract types, called kappa and lambda, based on the amiao acid sequences of their 
5 constant domains. 

DependtQg on the amino acid sequence of the constant domain of their heavy chains, 
immunoglobulins can be assigned to different classes. There are five major classes of immunoglobulins: 
IgA, IgD, IgE, IgG, and IgM, and several of these may be ftirther divided into subclasses (isotypes), e.g., 
IgGl, IgG2, IgG3, IgG4, IgA, and IgA2. 

10 "Siugle-chain Fv" or "sFv" antibody fragments comprise the Vh and Vl domains of antibody, 

wherein these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide ftirther 
comprises a polypeptide linker between the Vh and Vl domains which enables the sFv to form the desired 
structure for antigen binding. For a review of sFv, see Pluckthun in The Pharmacology of Monoclonal 
Antibodies, vol. 1 13, Rosenburg and Moore eds., Springer-Verlag, New York, pp. 269-3 15 (1994). 

15 The term "diabodies" refers to small antibody fragments with two antigen-bindiug sites, which 

fragments comprise a heavy-chain variable domain (Vh) comiected to a light-chain variable domain (Vl) in 
the same polypeptide chain (Vh-Vl). By usmg a linker that is too short to allow pairing between the two 
domains on the same chain, the domains are forced to pair with the complementary domains of another chain 
and create two antigen-binding sites. Diabodies are described more ftilly in, for example, EP 404,097; WO 

20 93/1 1161; and HoUinger et al., Proc. Natl. Acad. Sci. USA, 90:6444-6448 (1993). 

An "isolated" antibody is one which has been identified and separated and/or recovered from a 
component of its natural environment. Contaminant components of its natural environment are materials 
which would interfere with diagnostic or tlierapeutic uses for the antibody, and may include enzymes, 
hormones, and other proteinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody 

25 will be purified (1) to greater than 95% by weight of antibody as determined by the Lowry method, and most 
preferably more than 99% by weight, (2) to a degree sulBBLcient to obtain at least 15 residues of N-terminal or 
internal amino acid sequence by use of a spinning cup sequenator, or (3) to homogeneity by SDS-PAGE 
imder reducing or nonreducing conditions using Coomassie blue or, preferably, silver stain. Isolated 
antibody includes the antibody in situ within recombinant cells since at least one component of the 

30 antibody's natural environment will not be present. Ordinarily, however, isolated antibody wiU be prepared 
by at least one purification step. 

An antibody that "specifically binds to" or is "specific for" a particular polypeptide or an epitope on 
a particular polypeptide is one tliat binds to that particular polypeptide or epitope on a particular polypeptide 
without substantially binding to any other polypeptide or polypeptide epitope. 

35 The word "label" when used herein refers to a detectable compound or composition which is 

conjugated directly or indirectly to the antibody so as to generate a "labeled" antibody. The label may be 
detectable by itself (e.g. radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, may 
catalyze chemical alteration of a substrate compound or composition which is detectable. 

By "solid phase" is meant a non-aqueous matrix to which the antibody of the present invention can 

40 adhere. Examples of solid phases encompassed herein include those formed partially or entirely of glass 
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(e.g., controlled pore glass), polysaccharides (e.g., agarose), polyacrylamides, polystyrene, poljrviiiyl alcohol 
and silicones. In certain embodiments, depending on the context, the solid phase can comprise the well of an 
assay plate; in others it is a purification column (e.g., an affinity chromatography column). This term also 
includes a discontinuous solid phase of discrete particles, such as those described in U.S. Patent No. 
5 4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or 
surfactant which is useful for delivery of a drug (such as a PRO polypeptide or antibody thereto) to a 
mammal. The components of the liposome are commonly arranged ia a bilayer formation, similar to the 
lipid arrangement of biological membranes. 

10 A "small molecule" is defined herein to have a molecular weight below about 500 Daltons. 

The term "immune related disease" means a disease in which a component of the immune system of 
a manmial causes, mediates or otherwise contributes to a morbidity in the mammal. Also included are 
diseases in which stimulation or intervention of the immune response has an ameliorative effect on 
progression of the disease. Included within this term are immune-mediated inflammatory diseases, non- 
15 immune-mediated inflammatory diseases, infectious diseases, immunodeficiency diseases, neoplasia, etc. 

The term "monocyte/macrophage mediated disease" means a disease in which 
monocytes/macrophages directly or indirectly mediate or otlierwise contribute to a morbidity in a mammal. 
The monoc3^e/macrophage mediated disease may be associated with cell mediated effects, lymphokine 
mediated effects, e/c, and even effects associated with other immune cells if the cells are stimulated, for 

20 example, by the lymphokines secreted by monocytes/macrophages. 

Examples of immune-related and inflammatory diseases, some of which are immune mediated, 
which can be treated according to the invention include systemic lupus erytliematosis, rheumatoid arthritis, 
juvenile chronic arthritis, spondyloarthropathies, systemic sclerosis (scleroderma), idiopathic inflammatory 
myopathies (dermatomyositis, polymyositis), Sjogren's syndrome, systemic vasculitis, sarcoidosis, 

25 autoimmune hemolytic anemia (immune pancytopenia, paroxysmal nocturnal hemoglobinuria), autoimmune 
thromboc3^openia (idiopathic thrombocytopenic purpura, immune-mediated thrombocytopenia), thyroiditis 
(Grave's disease, Hashimoto's thyroiditis, juvenile lymphoc3^c thyroiditis, atrophic thyroiditis), diabetes 
mellitus, immime-mediated renal disease (glomerulonephritis, tubulointerstitial nephritis), demyelinating 
diseases of the central and peripheral nervous systems such as multiple sclerosis, idiopathic demyelinating 

30 polyneuropathy or Guillain-Barre syndrome, and chronic inflammatory demyelinating polyneuropathy, 
hepatobiliary diseases such as infectious hepatitis (hepatitis A, B, C, D, E and other non-hepatotropic 
viruses), autoimmune chronic active hepatitis, primary biliary cirrhosis, granulomatous hepatitis, and 
sclerosing cholangitis, inflammatory bowel disease (ulcerative colitis: Crohn's disease), gluten- sensitive 
enteropathy, and Whipple's disease, autoimmune or immune-mediated skin diseases including bullous skin 

35 diseases, erythema multiforme and contact dermatitis, psoriasis, allergic diseases such as asthma, allergic 
rhinitis, atopic dermatitis, food hypersensitivity and urticaria, immunologic diseases of the limg such as 
eosinophilic pneumonias, idiopathic pidmonary fibrosis and hypersensitivity pneumonitis, transplantation 
associated diseases including graft rejection and graft -versus-host-disease. Infectious diseases including 
viral diseases such as AIDS (HTV infection), hepatitis A, B, C, D, and E, herpes, etc,^ bacterial infections, 

40 fungal infections, protozoal infections and parasitic infections. 
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The term "effective amount" is a concentration or amount of a PRO polypeptide and/or 
agonist/antagonist which results in achieving a particular stated purpose. An "effective amount" of a PRO 
polypeptide or agonist or antagonist thereof may be determined empirically. Furthermore, a "therapeutically 
effective amount" is a concentration or amount of a PRO polypeptide and/or agonist/antagonist which is 
5 effective for achieving a stated therapeutic effect. This amount may also be determined empirically. 

The term "cytotoxic agent" as used herein refers to a substance that inhibits or prevents the function 
of cells and/or causes destruction of cells. The term is intended to include radioactive isotopes {e.g,, I^^^ 
ji25^ Re^^^), chemotherapeutic agents, and toxins such as enzymatically active toxins of bacterial, 

fungal, plant or animal origin, or fragments thereof. 

10 A "chemotherapeutic agent" is a chemical compoimd useful in the treatment of cancer. Examples 

of chemotherapeutic agents include adriamycin, doxorubicin, epirubicin, 5-fluorouracil, cytosine arabinoside 
("Ara-C"), cyclophosphamide, thiotepa, busulfan, cytoxin, taxoids, e.g,, paclitaxel (Taxol, Bristol-Myers 
Squibb Oncology, Princeton, NJ), and doxetaxel (Taxotere, Rhone-Poulenc Rorer, Antony, France), 
toxotere, methotrexate, cisplatin, melphalan, vinblastine, bleomycin, etoposide, ifosfamide, mitomycin C, 

15 mitoxantrone, vincristine, vinorelbine, carboplatin, teniposide, daunomycin, carminomycin, aminopterin, 
dactinomycin, mitomycins, esperamicins (see U.S. Pat. No. 4,675,187), melphalan and other related nitrogen 
mustards. Also included in this definition are hormonal agents that act to regulate or inhibit hormone action 
on tumors such as tamoxifen and onapristone. 

A "growth inhibitory agent" when used herein refers to a compound or composition which inhibits 

20 growth of a cell, especially cancer cell overexpressing any of the genes identified herein, either in vitro or in 
vivo. Thus, the growth inhibitory agent is one which significantiy reduces the percentage of cells 
overexpressing such genes in S phase. Examples of growth inhibitory agents include agents that block cell 
cycle progression (at a place other than S phase), such as agents that induce Gl arrest and M-phase arrest. 
Classical M-phase blockers include the vincas (vincristine and vinblastine), taxol, and topo II inhibitors such 

25 as doxorubicin, epirubicin, daunorubicin, etoposide, and bleomycin. Those agents that arrest Gl also spill 
over into S-phase arrest, for example, DNA alkylating agents such as tamoxifen, prednisone, dacarbazine, 
mechlorethamine, cisplatin, methotrexate, 5-fluorouracil, and ara-C. Fuither information can be found in 
The Molecular Basis of Cancer, Mendelsohn and Israel, eds.. Chapter 1, entitled "Cell cycle regulation, 
oncogens, and antineoplastic drugs" by Murakami et al, (WB Saunders: Philadelphia, 1995), especially p. 

30 13. 

The term "cytokine" is a generic term for proteins released by one cell population which act on 
anotiier cell as intercellular mediators. Examples of such cytokines are lymphokines, monokines, and 
traditional polypeptide hormones. Included among the cytokines are growth hormone such as human growth 
hormone, N-methionyl human growth hormone, and bovine growth hormone; parathyroid hormone; 

35 thyroxine; insulin; proinsulin; relaxin; prorelaxin; glycoprotein hormones such as follicle stimulating 
hormone (FSH), thyroid stimulating hormone (TSH), and luteinizing hormone (LH); hepatic growth factor; 
fibroblast growth factor; prolactin; placental lactogen; tumor necrosis fector-a and -P; mullerian-inhibiting 
substance; mouse gonadotropin-associated peptide; inhibin; activin; vascular endothelial growth factor; 
integrin; thrombopoietin (TPO); nerve growth factors such as NGF-p; platelet-growth factor; transforming 

40 growth factors (TGFs) such as TGF-a and TGF-P; insulin-like growth fector-I and -II; erythropoietm (EPO); 
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osteoinductive factors; interferons such as interferon-a, -P, and -y; colony stimulating factors (CSFs) such as 
macrophage-CSF (M-CSF); granulocyte-macrophage-CSF (GM-CSF); and granulocyte-CSF (G-CSF); 
interleukins (ILs) such as IL-1, IL-la, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-11, IL-12; a tumor 
necrosis factor such as TNF-a or TNF-P; and other polypeptide factors including LIF and kit ligand (KL). 
5 As used herein, the tenn cytokine includes proteins from natural sources or from recombiaant cell culture 
and biologically active equivalents of the native sequence cytokines. 

As used herein, the term "imoiunoadhesin" designates antibody-like molecules which combine the 
binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin 
constant domains. Structurally, the immunoadhesins comprise a frision of an amino acid sequence with the 

10 desired binding specificity which is other than the antigen recognition and binding site of an antibody is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least the binding site of a rec^tor or a 
ligand- The inmiunoglobulin constant domain sequence in the immunoadhesin may be obtained from any 
immunoglobulin, such as IgG-1, IgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, 

15 IgDorlgM. 
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Table 1 

* 

* C-C increased from 12 to 15 

* Z is average of EQ 

* B is average of ND 

* match with stop is _M; stop-stop = 0; J (joker) match = 0 
*/ 

#deflne _M -8 /* value of a match with a stop */ 



int 

/* a' 

/* A */ 
/*B*/ 
/*C*/ 
/* D */ 
/* E */ 
/* F */ 
/* G */ 
/* H */ 
/*!*/ 
/* J */ 
/*K*/ 
/*L*/ 
/*M*/ 
/*N*/ 
/* O */ 

p 

/* Q */ 
/*R*/ 
/*S */ 
/* "p */ 

/*U*/ 

Y *y 
/* W*/ 

/*x */ 
/* Y */ 
/*Z*/ 

}; 



_day[26][26] = { 

BCDEFGHIJKLMNOPQRSTUVWXYZ*/ 
2, 0,-2, 0, 0,-4, 1,-1,-1, 0,-1,-2,-1, 0,_M, 1, 0,-2, 1, 1, 0, 0,-6, 0,-3, 0}, 
0, 3,-4, 3, 2,-5, 0, 1,-2, 0, 0,-3,-2, 2,JVI,-1, 1, 0, 0, 0, 0,-2,-5, 0,-3, 1}, 
-2,-4,15,-5,-5,-4,-3,-3,-2, 0,-5,-6,-5,-4,_M,-3,-5,-4, 0,-2, 0,-2,-8, 0, 0,-5}, 
0, 3,-5, 4, 3,-6, 1, 1,-2, 0, 0,-4,-3, 2,_M,-1, 2,-1, 0, 0, 0,-2,-7, 0,-4, 2}, 

0, 2,-5, 3, 4,-5, 0, 1,-2, 0, 0,-3,-2, 1,_M,-1, 2,-1, 0, 0, 0,-2,-7, 0,-4, 3}, 
[-4,-5,-4,-6,-5, 9,-5,-2, 1, 0,-5, 2, 0,-4,_M,-5,-5,-4,-3,-3, 0,-1, 0, 0, 7,-5}, 

1, 0,-3, 1, 0,-5, 5,-2,-3, 0,-2,-4,-3, 0,_M,-l,-l,-3, 1, 0, 0,-1,-7, 0,-5, 0}, 
[-1, 1,-3, 1, 1,-2,-2, 6,-2, 0, 0,-2,-2, 2,_M, 0, 3, 2,-1,-1, 0,-2,-3, 0, 0, 2}, 
-1,-2,-2,-2,-2, 1,-3,-2, 5, 0,-2, 2, 2,-2,__M,-2,-2,-2,-l, 0, 0, 4,-5, 0,-1,-2}, 
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,_M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 

[-1, 0,-5, 0, 0,-5,-2, 0,-2, 0, 5,-3, 0, 1,^M,-1, 1, 3, 0, 0, 0,-2,-3, 0,-4, 0}, 
-2,-3,-6,-4,-3, 2,-4,-2, 2, 0,-3, 6, 4,-3,_M,-3,-2,-3,-3,-l, 0, 2,-2, 0,-1,-2}, 
[-1,-2,-5,-3,-2, 0,-3,-2, 2, 0, 0, 4, 6,-2,_M,-2,-l, 0,-2,-1, 0, 2,-4, 0,-2,-1}, 

0, 2,-4, 2, 1,-4, 0, 2,-2, 0, 1,-3,-2, 2,_M,-1, 1, 0, 1, 0, 0,-2,-4, 0,-2, 1}, 
_M,_M,_M,_,M,_M,_M,_M,_M,_M,_M,_M,_M,_M,_M, 

M,_M,_M,__M,_M,_M,__M,_M,_M}, 

1, -1,-3,-1,-1,-5,-1, 0,-2, 0,-l,-3,-2,-l,_M, 6, 0, 0, 1, 0, 0,-1,-6, 0,-5, 0}, 

0, 1,-5, 2, 2,-5,-1, 3,-2, 0, 1,-2,-1, 1,_M, 0, 4, 1,-1,-1, 0,-2,-5, 0,-4, 3}, 
-2, 0,-4,-1,-1,-4.-3, 2,-2, 0, 3,-3, 0, 0,_M, 0, 1, 6, 0,-1, 0,-2, 2, 0,-4, 0}, 

1, 0, 0, 0, 0,-3, 1,-1,-1, 0, 0,-3,-2, 1,_M, 1,-1, 0, 2, 1, 0,-1,-2, 0,-3, 0}, 
1, 0,-2, 0, 0,-3, 0,-1, 0, 0, 0,-1,-1, 0,_M, 0,-1,-1, 1, 3, 0, 0,-5, 0,-3, 0}, 
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,__M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 
0,-2,-2,-2,-2,-1,-1,-2, 4, 0,-2, 2, 2,-2,_M,- 1,-2,-2,-1, 0, 0, 4,-6, 0,-2,-2}, 
-6,-5,-8,-7,-7, 0,-7,-3,-5, 0,-3,-2,-4,-4,_M,-6,-5, 2,-2,-5, 0,-6,17, 0, 0,-6}, 
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,_M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 
-3,-3, 0,-4,-4, 7,-5, 0,-1, 0,-4,-l,-2,-2,_M,-5,-4,-4,-3,-3, 0,-2, 0, 0,10,-4}, 
0, 1,-5, 2, 3,-5, 0, 2,-2, 0, 0,-2,-1, 1,_M, 0, 3, 0, 0, 0, 0,-2,-6, 0,-4, 4} 
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Table 1 (cont^) 

/* 
*/ 

#include <stdio.h> 
#include <ctype.h> 



^define 


MAXJMP 


16 


/* max jumps in a diag */ 


^define 


MAXGAP 


24 


/* don't continue to penalize gaps larger than this */ 


#define 


IMPS 


1024 


/* max jmps in an path */ 


#define 


MX 


4 


/* save if there's at least MX-1 bases since last jmp */ 


#define 


DMAT 


3 


/* value of matching bases */ 


#define 


DMIS 


0 


/* penalty for mismatched bases */ 


^define 


DINSO 


8 


/* penalty for a gap */ 


#define 


DINSl 


1 


/* penalty per base */ 


^define 


PINSO 


8 


/* penalty for a gap */ 


^define 


PINSl 


4 


/* penalty per residue */ 



stract jmp { 

short n[MAXJMP]; /* size of jmp (neg for dely) */ 

unsigned short x[MAXJMP]; /* base no. of jmp in seq x */ 

}; /* limits seq to 2'^ 16 -1 */ 



struct diag { 








int 


score; 


/* score at last jmp */ 




long 


offset; 


/* offset of prev block */ 




short 


ijmp; 


/* current jmp index */ 


}; 


struct jmp jp; 


/* list of jmps */ 


struct path { 








int 


spc; /* 


' number of leading spaces */ 




short 


n[JMPS];/* size of jmp (gap) */ 


}; 


int 


x[JMPS];/* loc of jmp (last elem before gap) */ 


char 




*ofile; 


/* output file name 


char 




*namex[2]; 


/* seq names: getseqs() */ 


char 




*prog; 


/* prog name for err msgs */ 


char 




*seqx[2]; 


/* seqs: getseqs() */ 


int 




dmax; 


/* best diag: nw() */ 


int 




dmaxO; 


/* final diag */ 


int 




dna; 


/* set if dna: main() */ 


mt 




endgaps; 


/* set if penalizing end gaps */ 


mt 




gapx, gapy; 


/* total gaps in seqs */ 


mt 




lenO, lenl; 


/* seq lens */ 


mt 




ngapx, ngapy; 


/* total size of gaps */ 


int 




smax; 


/* max score: nwQ */ 


int 




*xbm; 


/* bitmap for matching */ 


long 




offset; 


/* current offset in jmp file */ 


struct 


diag 


*dx; 


/* holds diagonals */ 


struct 


path 


pp[2]; 


/* holds path for seqs */ 


char 




*calloc(), *malloc(). 


*indexO, *strcpy(); 


char 




*getseq(), *g_calloc(); 
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Table 1 (cont^) 

/* Needleman-Wunsch alignment program 

* 

* usage: progs filel file2 

* where filel and file2 are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lines beginning with ' > ' or ' < ' are ignored 

* Max file length is 65535 (limited by unsigned short x in the jmp struct) 

* A sequence with 1/3 or more of its elements ACGTU is assumed to be DNA 

* Output is in the file "align.out" 

* The program may create a trap file in /tmp to hold info about traceback. 

* Original version developed under BSD 4.3 on a vax 8650 

*/ 

#iDclude "nw.h" 
^include "day.h" 

static _dbval[26] = { 

1 , 14,2, 13,0,0,4, 1 1 ,0,0, 12,0,3, 15,0,0,0,5,6,8,8,7,9,0, 10,0 



static _pbval[26] = { 

1, 2|(1< <CD*-'A'))|(1< <CN'-'A')), 4, 8, 16, 32, 64, 
128, 256, OxFFFFFFF, 1< < 10, 1< < 11, 1< < 12, 1< < 13. 1< < 14. 
1<<15, 1<<16, 1<<17, 1<<18, 1<<19, 1<<20, 1< <21, 1< <22, 
1< <23, 1< <24, 1< <25 1 ( 1< < (»E»-'A*)) | (1< < CQ'-' A')) 

}; 

main(ac, av) 

main 

int ac; 
char *av[ ]; 

{ 

prog = av[0]; 
if (ac != 3) { 

fprintf(stderr, "usage: %s filel file2\n", prog); 

^rintf(stderr, "where filel and file2 are two dna or two protein sequences.\n"); 
5)rintf(stderr,"The sequences can be in upper- or lower-case\n"); 
5)rintf(stderr,"Any lines beginning with ';' or ' < * are ignored\n"); 
§)rintf(stderr, "Output is in die file \ "align. out\"\n"); 
exit(l); 

} 

namex[0] = av[l]; 

namex[l] = av[2]; 

seqx[0] = getseq(namex[0], &len0); 

seqx[l] == getseq(namex[l], &lenl); 

xbm = (dna)? jlbval : j)bval; 

endgaps = 0; /* 1 to penalize endgaps */ 

ofile = "align.out"; /* output file */ 

nw(); /* fill in the matrix, get the possible jmps */ 

readjmpsO; /* get the actual jmps */ 

printO; /* print stats, alignment */ 

cleanup(0); /* unlink any tmp files */ 
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Table 1 (conV) 

/* do the alignment, return best score: mainO 

* dna: values in Fitch and Smith, PNAS, 80, 1382-1386, 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 

* a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 



*/ 








nw() 








{ 


nw 








char 


*px, *py; 


/* seqs and ptrs */ 




int 


*ndely, *dely; 


/* keep track of dely */ 
/* keep track of delx */ 




int 


ndelx, delx; 




int 


*tmp; 


/* for swapping rowO, rowl */ 




int 


mis; 


/* score for each type */ 




int 


insO, insl; 


/* insertion penalties */ 




register 


id; 


/* diagonal index */ 




register 


ij; 


/* jmp index */ 




roister 


*colO, *coll; 


/* score for curr, last row */ 




register 


XX, yy; 


/* index into seqs */ 



dx = (struct diag *)g_calloc("to get diags", len04-lenl + l, sizeof(struct diag)); 

ndely = (int *)g_calloc("to getndely", lenl + 1, sizeof(int)); 
dely — (int *)g_calloc("to get dely", lenl + 1, sizeof(int)); 
colO = (int *)g_calloc("to get colO", lenl + 1, sizeof(int)); 
coll = (int *)g_calloc("to get coil", lenl + l, sizeof(int)); 
insO = (dna)? DINSO : PINSO; 
insl = (dna)? DINSl : PINSl; 

smax = -10000; 
if (endgaps) { 

for (colO[0] = dely[0] = -insO, yy = 1; yy < = lenl; yy++) { 
colO[yy] = dely[yy] = colO[yy-l] - insl; 
ndelyiyy] = yy; 

} 

colO[0] = 0; /* Waterman Bull Math Biol 84 */ 

} 

else 

for (yy " 1; yy < = lenl; yy-l-+) 
dely[yy] — -insO; 

/* fill in match matrix 
*/ 

for (px = seqx[01, xx = 1; xx < = lenO; px++, xx+ +) { 
/* initialize first entry in col 

*/ 

if (endgaps) { 

if(xx== 1) 

coll[0] = delx ^ -(insO+insl); 

else 

coll[0] = delx = colO[0] - insl; 
ndelx = xx; 

} 

else { 

coll[0] = 0; 
delx — -insO; 
ndelx == 0; 

} 
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Table 1 (cont^) 

•..nw 

for (py = seqx[l], yy = 1; yy <= lenl; pyH-4-, yy++) { 
mis — colO[yy-l]; 
if (dna) 

mis (xbm[*px-'A1&xbm[*py-'A'])? DMAT : DMIS; 

else 

mis _day[*px-'A*l[*py-'A']; 

/* update penalty for del in x seq; 

* favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 
*/ 

if (endgaps | | ndely[yy] < MAXGAP) { 

if (colO[yy] - insO > = dely[yy]) { 

dely[yy] = colO[yy] - (insO+insl); 
ndelyCyy] = 1; 

} else { 

dely[yy] -= insl; 
ndely[yy] + + ; 

} 

} else { 

if (colO[yy] - (insO+insI) > = dely[yy]) { 

dely[yy] = colO[yy] - (insO+insI); 
ndely[yy] = 1; 

} else 

ndely[yy] + + ; 

} 

/* update penalty for del in y seq; 

* favor new del over ongong del 

*/ 

if (endgaps | | ndelx < MAXGAP) { 

if (coll[yy-l] - insO > = delx) { 

delx = coll[yy-l] - (insO+insI); 
ndelx = 1; 

} else { 

delx insl; 
ndelx + +; 

> 

} else { 

if (coll[yy-l] - (insO+insI) > = delx) { 
delx = coll[yy-l] - (insO+insI); 
ndelx = 1; 

} else 

ndelx + +; 

> 

/* pick the maximum score; we're favoring 

* mis over any del and delx over dely 

*/ 
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Table 1 (cont^) 

••.nw 

id = XX - yy + ienl - 1 ; 
if (mis > = delx && mis > = dely[yy]) 
5 coll[yy] = mis; 

dse if (delx > = dely[yy]) { 

colllyy] == delx; 

ij — dx[id].ijmp; 

if (dx[id].jp.n[0] && (!dna | | (ndelx > = MAXJMP 
10 &&XX > dx[id].jp.x[ij]+MX) 1 1 mis > dx[id]. score +DINSO)) { 

dx[id].ijmp+ + ; 
if (+ +ij > MAXJMP) { 
writejmps(id); 
ij = dx[id].ijmp = 0; 

15 dxpd]. offset = offset; 

offset += sizeof (struct jmp) + sizeof (offset); 

} 

> 

dx[id] jp.n[ij] = ndelx; 
20 dx[id].jp.x[ij] = xx; 

dx[id]. score = delx; 

} 

else { 

colllyy] = dely[yy]; 
25 ij = dx[id].ijmp; 

if (dx[id].jp.n[0] && (!dna 1 1 (ndely[yy] > = MAXJMP 

&&XX > dx[id]Jp.x[ij]+MX) 1 1 mis > dx[id]. score +DINSO)) { 
dx[id].ijmp + +; 
if (+ +ij > = MAXJMP) { 
30 writejmps(id); 

ij = dx[id].ijmp — 0; 
dx[id]. offset = offset; 

offset += sizeof(struct jmp) + si2:eof(offset); 

} 

35 } 

dx[id].jp.n[ij] = -ndely[yy]; 
dx[id].jp.x[ij] = xx; 
dx[id]. score = dely[yy]; 

> 

40 if (xx == lenO&&yy < Ienl) { 

/* last col 
*/ 

if (endgaps) 

coll[yy] -= insO+insl*(lenl-yy); 
45 if (coll [yy] > smax) { 

smax = colllyy]; 
dmax — id; 

} 

} 

50 } 

if (endgaps && xx < lenO) 

coll[yy-l] -= insO+insl*(lenO-xx); 
if (coil[yy~l] > smax) { 

smax = coll[yy-l]; 
55 dmax = id; 

} 

tmp == colO; colO = coll; coll = tmp; 

> 

(void) free((char *)ndely); 
60 (void) free((char *)dely); 

(void) free((char *)colO); 

(void) free((chiar *)coll); } 
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Table 1 (cont^) 

* printO ~ only routine visible outside this module 
* 

* static: 

* getmatO — trace back best path, count matches: printO 

* pr alignO — print alignment of described in array p[ ]: print() 

* dumpblockO — dump a block of lines with numbers, stars: pr_align() 

* numsQ — put out a number line: dumpblock() 

* putlineO - put out a line (name, [num], seq, [num]): dumpblockQ 

* starsO - -put a line of stars: dumpblockQ 

* stripnameO — strip any path and prefix from a seqname 
*/ 

^include "nw.h" 
#defme SPC 3 

Mefine P_LINE 256 /* maximum output line */ 

^define P_SPC 3 /* space between name or num and seq */ 

extern day [26] [26]; 

int olen; /* set output line length */ 

FILE *fx; /* output file */ 

printO 



{ 



print 

int Ix, ly, firstgap, lastgap; /* overlap */ 

if ((fx = fopen(ofile, "w")) == 0) { 

^rintf(stderr,"%s: can't write %s\n", prog, ofile); 
cleanup(l); 

} 

:^rintf(fx, "< first sequence: %s (length = %d)\n", namex[0], lenO); 
^rintfCfx:, "< second sequence: %s (length = %d)\n", namex[l], lenl); 
olen = 60; 
Ix = lenO; 
ly == lenl; 

firstgap = lastgap — 0; 

if (dmax < lenl - 1) { /* leading gap in x */ 
pp[0].spc = firstgap = lenl - dmax - 1; 
ly-= pp[0].spc; 

} 

else if (dmax > lenl - 1) { /* leading gap in y */ 
pp[l],spc = firstgap = dmax - (lenl - 1); 
lx-= pp[l].spc; 

} 

if (dmaxO < lenO - 1) { /* trailing gap in x */ 
lastgap = lenO - dmaxO -1; 
Ix -= lastgap; 

} 

else if (dmaxO > lenO - 1) { /* trailing gap in y */ 
lastgap == dmaxO - (lenO - 1); 
ly lastgap; 

} 

getmat(lx, ly, firstgap, lastgap); 
pr^alignO; 
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Table 1 (cont^) 

/* 

* trace back the best path, count matches 

*/ 

static 

getmat(lx, ly, firstgap, lastgap) getmat 
int Ix, ly; /* "core" (minus endgaps) */ 

int firstgap, lastgap; /* leading trailing overlap */ 

{ 

int nm, iO, il, sizO, sizl; 

char outx[32]; 
double pet; 
register nO, nl; 

register char *pO, *pl; 

/* get total matches, score 
*/ 

iO = il = sizO = sizl = 0; 
pO = seqx[0] + pp[l].spc; 
pi = seqx[l] + pp[0].spc; 
nO = pp[l].spc + 1; 
nl = pp[0].spc + 1; 



nm = 0; 

while ( *pO && *pl ) { 
if (sizO){ 

P14- + ; 
nl + + ; 
sizO--; 

> 

else if (sizl) { 

pO+ + ; 
nO++; 
sizl—; 

} 

else { 

if (xbm[*pO-'A']&xbm[*pl-'A']) 

nm+ + ; 
if (nO + + = = pp[0] .x[iO]) 

sizO = pp[0].n[iO++]; 
if(nl + + ==pp[l].x[il]) 

sizl =pp[l].n[iH--l-]; 

pO+ + ; 

pi ++; 

} 

} 

/* pet homology: 

* if penalizing endgaps, base is the shorter seq 

* else, knock off overhangs and take shorter core 

*/ 

if (endgaps) 

Ix = (lenO < lenl)? lenO : lenl; 

else 

Ix = (Ix < ly)? Ix : ly; 
pet = 100.*(double)mn/(double)bc: 
^rintf(fx, "\n"); 

fjprmtf(fx, " < %d match%s in an overlap of %d: %.2f percent similarity\n", 
nm, (nm 1)? : "es", Ix, pet); 
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Table 1 (cont^ 

^rintf(fx, " <gaps in first sequence: %d", gapx); 

...getmat 

if (gapx) { 

(void) sprintf(outx, " (%d %s%s)", 

ngapx, (dna)? "base": "residue", (ngapx == 1)? "":"s"); 
Q)rintf(fx,"%s", outx); 

j5)rintf(fx, gaps in second sequence: %d", gapy); 
if(gapy){ 

(void) sprintf(outx, " (%d %s%s)", 

ngapy, (dna)? "base": "residue", (ngapy == 1)? "":"s"); 

Q)rintf(fx,"%s", outx); 

} 

if (dna) 

flprintf(fe, 

"\n< score: %d (match = %d, mismatch = %d, gap penalty = %d + %d per base)\n", 
smax, DMAT, DMIS, DINSO, DINSl); 

else 

l5printf(fx, 

"\n< score: %d (Dayhoff PAM 250 matrix, gap penalty = %d + %d per residue)\n", 
smax, PINSO, PINSl); 
if (endgaps) 

:^rintf(fx, 

"< endgaps penalized, left endgap: %d %s%s, right endgap: %d %s%s\n", 
firstgap, (dna)? "base" : "residue", (firstgap == 1)? "" : "s", 
lastgap, (dna)? "base" : "residue", (lastgap == 1)? "" : "s"); 



fjprintf(fx, "< endgaps not penalized\n"); 

} 

static nm; /* matches in core — for checking */ 

static Imax; /* lengths of stripped file names */ 

static ij[2] ; /* jmp index for a path */ 

static nc[2]; /* number at start of current line */ 

static ni[2]; /* current elem number — for gapping */ 

static siz[2]; 

static char '*'ps[2]; /* ptr to current element */ 

static char '^po[2]; /* ptr to next output char slot */ 

static char out[2][P_LINE]; /* output liue */ 

static char star[P_LINE]; /* set by stars() */ 

/* 

* print alignment of described in struct path pp[ ] 

*/ 

static 
pr_align() 

pr align 

{ 

int nn; /* char count */ 

int more; 

i; 



for (i = 0, Imax = 0; i < 2; i++) { 
nn = stripname(namex[i]); 
If (nn > Imax) 

Imax = nn; 

nc[i] = 1; 
ni[i] = 1; 
siz[i] = ij[i] = 0; 
ps[i] = seqx[i]; 

po[i] = out[i]; } 
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Table 1 (cont^ 

for (nn = nm = 0, more = 1; more; ) { 

...pr_align 

for (i = more = 0; i < 2; i+ +) { 

/* 

* do we have more of this sequence? 

*/ 

if(!*psli]) 

continue; 

more+ + ; 



if (pp[i]-spc) { /* leading space */ 
*po[i] + + = • •; 
15 pp[i],spc-; 

} 

if (siz[i]) { /* in a gap */ 
*po[i] + + = '-'; 
siz[i]~; 

20 } 

else { /* we're putting a seq element 

*/ 

*po[i] = *ps[i]; 
if (islower(*ps[i])) 

25 *ps[i] = toupper(*ps[i]); 

po[i]++; 
ps[i]++; 

/* 

30 * are we at next gap for this seq? 

*/ 

if(ni[i] -=pp[i].x[ij[il]){ 
/* 

* we need to merge all gaps 
35 * at this location 

*/ 

siz[i] pp[i].n[ij[i] + +]; 
while (ni[i] ==pp[i].x[ij[i]]) 

siz[i] += pp[i].n[ij[i] + +]; 

40 } 

ni[i] + + ; 

} 

} 

if (+ +nn == = olen 1 1 !more && nn) { 
45 dumpblockO; 

for(i = 0; i < 2; i++) 
po[i] = out[i]; 

im = 0; 

} 

50 1 



/* 

* dump a block of lines, including numbers, stars: pr_alignO 
55 */ 

static 

dumpblockO 

dumpblock 

{ 

60 register i; 

for(i = 0; i < 2; i++) 
*po[i]- = '\0'; 
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Table 1 (cont^) 

...dumpblock 



> 



(void) putc('\n', fx); 
for(i = 0; i < 2; i++) { 

if (*out[i] && (*out[i] != ' ' II *(po[i]) !=*')){ 
if(i-=0) 

nums(i); 
if(i==0&& *out[l]) 
starsO; 

putline(i); 

if (i = = 0&& *out[l]) 

^rmtf(fx, star); 
if(i== 1) 

nums(i); 

} 

} 



/* 

* put out a number line: dumpblockQ 

*/ 

static 

nums(ix) numS 
int ix; /* index in out[ ] holding seq line */ 

{ 

char nline[P_LINE]; 

register i, j; 

register char *pn, *px, *py; 

for (pn = nline, i = 0; i < Imax+P SPC; i+ + , pn++) 

*pn = * 

for (i = nc[ix], py = out[ix]; *py; py+ + , pn++) { 

if(*py . . II H:py==:=3: -) 

*pn = ' '; 

else { 

if (i%10 0 I I (i 1 && nc[ix] != 1)) { 
j = (i < 0)? -i : i; 
for (px = pn; j; j /= 10, px— ) 
*px = j%10 + '0'; 

if (i < 0) 

*px = 



} 

else 
i++; 



*pn = • •; 



} 



} 



nc[ix] = i; 

for (pn = nline; *pn; pn+ +) 
(void) putc(*pn, fie); 
(void) putc('\n', fx); 



/* 

* put out a line (name, [num], seq, [num]): dumpblock() 
static 

putline(ix) putline 
int ix; { 
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Table 1 (cont^) 



int i; 
5 register char *px; 

for (px = namex[ix], i = 0; *px && *px != px+ +, i+ +) 

(void) putc(*px, fx); 
for (; i < lmax+P_SPC; i++) 
10 (void)putcC fx); 

/* these count from 1: 

* ni[ ] is current element (from 1) 

* nc[ ] is number at start of current line 
15 */ 

for (px = out[ix]; *px; px+ +) 

(void) putc(*px&0x7F, fx); 
(void) putc('\n', fx); 

} 

20 

/* 

* put a line of stars (seqs always in out[0], out[l]): dumpblock() 
*/ 



...putline 



stars 



25 static 

starsO 

{ 

int i; 
30 roister char *pO, *pl, cx, *px; 

if (!*out[0] I I (*out[0] *(po[0]) = = • ') 1 1 

!*out[l] 1 1 (*out[l] *(po[l]) ===•')) 

return; 
35 px = star; 

for (i = Imax+P SPC; i; i-) 
*px4-+ = ' '; 

for (pO = out[0], pi = out[l]; *pO && *pl; pO+ + , pH-+) { 
40 if (isalpha(*pO) && isalpha(*pl)) { 

if (xbm[*pO-'A']&xbm[*pl-'A']) { 
cx = '*'; 
nm-f + ; 

45 } 

elseif (!dna&&_day[*pO-'A'][*pl-*A'] > 0) 
cx= •/; 

else 

cx = * *; 

50 } 

else 

cx = ' '; 
*px+ + = cx; 

} 

55 *px++ = '\n'; 

*px = '\0*; 
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Table 1 (cont^) 

/* 

* strip path or prefix from pn, return len: pr^alignQ 

*/ 

5 static 

stripname(pn) 

stripname 

char *pn; /* file name (may be path) */ 

{ 

10 register char *px, *py; 

py-0; 

for (px = pn; *px; px+ +) 
if (*px ==•/•) 
15 py = px + 1; 

if(py) 

(void) strcpy(pn, py); 
return(strlen(pn)); 

20 } 
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Table 1 (cont^) 

/* 

* cleanupO — cleanup any tmp file 

* getseqO — read in seq, set dna, len, maxlen 

* g callocO — callocO with error checldn 

* readjmpsO — get the good jmps, from tmp file if necessary 

* writejmpsO — write a filled array of jmps to a tmp file: nw() 
*/ 

^include "nw.h" 
#include <sys/file.h> 



char *jname = "/tmp/homgXXXXXX"; 
FILE *g; 

int cleanupO; 
long IseekO; 

/* 

* remove any tmp file if we blow 

*/ 

cleanup(i) 



{ 



int 

if (9) 
exit(i); 



i; 



(void) iinlink(jname); 



/* tmp file for jmps */ 
/* cleanup tmp file */ 



cleanup 



/* 

* read, return ptr to seq, set dna, len, maxlen 
skip lines starting with *; ' < \ or * > * 

* seq in upper or lower case 
*/ 

char * 
getseq(file, len) 
char 
mt 



*file; 
*len; 



char 

register char 
mt 

FILE 



/* file name */ 
/* seq len */ 

line[1024], *pseq; 
*px, *py; 
natgc, tlen; 



getseq 



if ((^ = fopen(file,"r")) == 0) { 

fprintf(stden:,"%s: can't read %s\n", prog, file); 
exit(l); 

} 

tlen = natgc = 0; 

while (fgets(line, 1024, fp)) { 

if (*line == | | *line == | | *line == '>') 

continue; 
for (px = line; *px != '\n'; px4-+) 

if (isupper(*px) | | islower(*px)) 
tlen++; 

} 

if ((pseq = malloc((unsigned)(tlen+6))) == 0) { 

fprintf(stderr,"%s: mallocQ failed to get %d bytes for %s\n", prog, tlen+6, file); 

exit(l); 

} 

pseq[0] = pseq[l] — pseq[2] = pseq[3] = '\0'; 
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Table 1 (cont^ 

...getseq 

py = pseq + 4; 
*len = tlen; 
rewind(J5>); 

whUe (fgetsOine, 1024, fp)) { 

if(*line== ';' 1| *line || *line 

continue; 

for (px = line; *px != '\n*; px++) { 
if (isupper(*px)) 

*py++ = *px; 
else if (islower(*px)) 

*py+ + = toupper(*px); 
if (index("ATGCU",*(py-l))) 

natgc++; 

} 

} 

*py++ = '\0'; 

*py = '\0'; 
(void) fclose(fjp); 
dna = natgc > (tlen/3); 
return(pseq + 4) ; 

} 

char * 

g_calloc(msg, nx, sz) gjCallOC 
char *msg; /* program, calling routine */ 

int nx, sz; /* number and size of elements */ 

{ 

char *px, *caUoc(); 

if ((px = calloc((unsigned)nx, (iinsigned)sz)) = = 0) { 
if (*msg) { 

^rintf(stderr, "%s: g_callocO failed %s (n=%d, sz=%d)\n", prog, msg, nx, sz); 
exit(l); 

} 

} 

return(px); 

} 

/* 

* get final jmps from dx[ ] or tmp file, set pp[ ], reset dmax: naiain() 
*/ 

readjmpsO 

readjmps 

{ 

int fd = -1; 

int siz, iO, il; 

roister i, j, xx; 

(void) fclose(fj); 

if ((fd = openQname, 0__RDONLY, 0)) < 0) { 

^rintf(stderr, "^s: can't open() %s\n", prog, jname); 
cleanup(l); 

} 

} 

for (i = iO = il = 0, dmaxO = dmax, xx = lenO; ; i+ +) { 
whQe (1) { 

for (j = dx[dmax].ijmp; j > = 0 && dx[dmax].jp.x[j] > = xx; j-) 
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Table 1 (cont^ 

•••readjmps 

if Q < 0 && dx[dmax]. offset && { 

(void) lseek(fd, dx[dmax]. offset, 0); 

(void) read(fd, (char *)&dx[dmax].jp, sizeof(struct jmp)); 

(void) read(fd, (char *)&dx[dmax]. offset, sizeof(dx[dmax]. offset)); 

dx[dmax].ijmp = MAXJMP-1; 

} 



10 break; 
} 

if (i > = JMPS) { 

^rintf(stderr, "%s: too many gaps in alignment\n", prog); 
cleanup(l); 

15 } 

if a > = o){ 

siz = dx[dniax].jp.n[j]; 
XX = dx[dmax].jp.x|j]; 
dmax + = siz; 

20 if (siz < 0) { /* gap in second seq */ 

pp[l].n[il] = -siz; 
XX -H = siz; 

/* id = XX - yy + lenl - 1 
*/ 

25 pp[l].x[il] = XX - dmax + lenl - 1; 

gapy+ + ; 
ngapy -= siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (-siz < MAXGAP 1 1 endgaps)? -siz : MAXGAP; 
30 iH- + ; 

} 

else if (siz > 0) { /* gap in first seq */ 
pp[0].n[iO] = siz; 
pp[0].x[iO] = xx; 
35 gapxH--!-; 

ngapx += siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (siz < MAXGAP 1 1 endgaps)? siz : MAXGAP; 
iO+ + ; 

40 } 
} 

else 

break; 

} 

45 

/* reverse the order of jmps 

*/ 

for (j = 0, iO-; j < iO; j + + , iO-) { 

i = pp[0].nD]; pp[0].nD] = pp[0].n[i0]; pp[0].n[i0] = i; 
50 i = pp[0].x|j]; pp[0].xD] = pp[0].x[i0]; pp[0],x[i0] = i; 

> 

for (j = 0, il-; j < il; j + + , il--) { 

i = PP[l].nIj]; pp[l].nO] = pp[l].n[il]; pp[l].n[il] = i; 
i == pp[l].x[j]; pp[l].x|j] = pp[l].x[il]; pp[l].x[il] = i; 

55 } 

if (fd > = 0) 

(void) close(fd); 

if(g){ 

(void) nnlink(jname); 
60 S = 0; 

offset = 0; 
} } 
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Table 1 (cont^ 

/* 

* write a filled jmp struct offset of the prev one (if any): nw() 
5 */ 

writejmps(ix) 

writejmps 

int ix; 

{ 

10 char *mktemp(); 

if (mktemp(jname) < 0) { 

Q)rintf(stderr, "%s: can't mktempO %s\n", prog, jname); 
15 cleaaup(l); 
} 

if ((g = fopen(jname, "w")) == 0) { 

^rintf(stderr, "%s: can't write %s\n", prog, jname); 
exit(l); 

20 } 
} 

(void) fwrite((char *)&dx[ix].jp, sizeof(struct jmp), 1, fl); 
(void) fwrite((char *)&dx[ix]. offset, sizeof(dx[ix]. offset), 1,5); 

} 

25 
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PRO 

Comparison Protein 
% amino acid sequence identity = 



Table 2 

xxxxxxxxxxxxxxx 

XXXXXYYYYYYY 



(Length =15 amino acids) 

(Length = 12 amino acids) 



(the number of identically matching amino acid residues between the two polypeptide sequences as 
10 determined by ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) = 
5 divided by 15 = 33.3% 



15 PRO 

Comparison Protein 
% amino acid sequence identity = 



Table 3 

XXXXXXXXXX 
XXXXXYYYYYYZZYZ 



(Length = 10 amino acids) 

(Length = 15 amino acids) 



(the number of identically matching amino acid residues between the two polypeptide sequences as 
20 determined by ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) 
5 divided by 10 = 50% 



Table 4 



25 PRO-DNA 

Comparison DNA 



NNNNNNNNNNNNNN 
NNNNNNLLLLLLLLLL 



(Length = 14 nucleotides) 
(Length = 16 nucleotides) 



% nucleic acid sequence identity = 

30 (the number of identically matching nucleotides between the two nucleic acid sequences as determined by 
ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 
6 divided by 14 = 42.9% 



35 



PRO-DNA 
Comparison DNA 



Table 5 

NNNNNNNNNNNN 
NNNNLLLVV 



(Length =12 nucleotides) 
(Length = 9 nucleotides) 
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% nucleic acid sequence identity = 

(the number of identically matching nucleotides between tiie two nucleic acid sequences as deteimined by 
ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 
4 divided by 12 = 33.3% 

II. Compositions and Methods of the Invention 

A. Full-Length PRO Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding 
polypeptides referred to in tiie present application as PRO polypeptides. In particular, cDNAs encoding 
various PRO polypeptides have been identified and isolated, as disclosed in further detail in the Examples 
below. However, for sake of simplicity, in the present specification the protein encoded by the full length 
native nucleic acid molecules disclosed herein as well as all further native homologues and variants 
mcluded in the foregoing definition of PRO, will be referred to as "PRO/number", regardless of their 
origin or mode of preparation. 

As disclosed in the Examples below, various cDNA clones have been disclosed. The predicted 
amino acid sequence can be determined from the nucleotide sequence using routine skill. For the PRO 
polypeptides and encoding nucleic acids described herein. Applicants have identified what is believed to be 
the reading frame best identifiable with the sequence information available at the tune. 

B. PRO Polypeptide Variants 

In addition to the fall-length native sequence PRO polypeptides described herein, it is 
contemplated tiiat PRO variants can be prepared, PRO variants can be prepared by introducing 
appropriate nucleotide changes into the PRO DNA, and/or by synthesis of the desired PRO polypeptide. 
Those skilled in the art will appreciate that amino acid changes may alter post-translational processes of 
the PRO, such as changing the number or position of glycosylation sites or altering the membrane 
anchoring characteristics. 

Variations in the native full-length sequence PRO or in various domains of the PRO described 
herein, can be made, for example, using any of the techniques and guidelines for conservative and non- 
conservative mutations set forth, for instance, in U.S. Patent No. 5,364,934. Variations may be a 
substitution, deletion or insertion of one or more codons encodmg the PRO that results in a change in the 
amino acid sequence of the PRO as compared with the native sequence PRO. Optionally, the variation is 
by substitution of at least one amino acid with any other amino acid in one or more of the domains of the 
PRO. Guidance in determining which amino acid residue may be inserted, substituted or deleted without 
adversely affectmg the desired activity may be found by comparmg the sequence of the PRO with that of 
homologous known protein molecules and minimizing the number of ammo acid sequence changes made 
in regions of high homology. Amino acid substitutions can be the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, such as the replacement of a 
leucine with a serine, i.e., conservative amino acid replacements. Insertions or deletions may optionally 
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be in the range of about 1 to 5 amino acids. The variation allowed may be determined by systematically 
making insertions, deletions or substitutions of amino acids in the sequence and testing the resulting 
variants for activity exhibited by the full-length or mature native sequence. 

PRO polypeptide fragments are provided herein. Such fragments may be truncated at the N- 
terminus or C~terminus, or may lack internal residues, for example, when compared with a fall lengtihi 
native protein. Certain fragments lack amino acid residues that are not essential for a desired biological 
activit}^ of the PRO polypeptide. 

PRO fragments may be prepared by any of a number of conventional techniques. Desired peptide 
fragments may be chemically synthesized. An alternative approach involves generating PRO fragments by 
enzymatic digestion, e.g., by treating the protein with an enzyme known to cleave proteins at sites defined 
by particular amino acid residues, or by digesting the DNA with suitable restriction enzymes and isolating 
the desired fragment. Yet another suitable technique involves isolating and amplifying a DNA fragment 
encoding a desured polypeptide fragment, by polymerase chain reaction (PGR). Oligonucleotides that 
define the desired termmi of the DNA fragment are employed at the 5' and 3' primers in the PGR. 
Preferably, PRO polypeptide fragments share at least one biological and/or immunological activity with 
the native PRO polypeptide disclosed herein. 

In particular embodiments, conservative substitutions of iaterest are shown in Table 6 under the 
heading of preferred substitutions. If such substitutions result in a change in biological activity, then more 
substantial changes, denominated exemplary substitutions in Table 6, or as further described below in 
reference to amino acid classes, are introduced and the products screened. 
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Table 6 





Original 


Exemplary 


Preferred 


5 


Residue 


Substitutions 


Substitutions 




Ala (A) 


val; leu; ile 


val 




Arg (R) 


lys; gin; asn 


lys 




Asn (N) 


gin; his; lys; arg 


gin 




Asp (D) 


glu 


glu 


10 


Cys (C) 


ser 


ser 




Gln(Q) 


asn 


asn 




Glu (E) 


asp 


asp 




Gly (G) 


pro; ala 


ala 




His (H) 


asn; gin; lys; arg 


arg 


15 


He a) 


leu; val; met; ala; phe; 








norleucine 


leu 




Leu(L) 


norleucine; ile; val; 








met; ala; phe 


ile 




Lys (K) 


arg; ghi; asn 


arg 


20 


Met (M) 


leu; phe; ile 


leu 




Phe (F) 


leu; val; ile; ala; tyr 


leu 




Pro (P) 


ala 


ala 




Ser (S) 


thr 


thr 




Thr (T) 


ser 


ser 


25 


Trp (W) 


tyr; phe 


tyr 




Tyr (Y) 


trp; phe; thr; ser 


phe 




Val(V) 


ile; leu; met; phe; 








ala; norleucine 


leu 



30 Substantial modifications in function or immunological identity of the PRO polypeptide are 

accomplished by selecting substitutions that differ significantly in their effect on maintaining (a) the 
structure of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical 
conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side 
chain. Naturally occurring residues are divided into groups based on common side-chain properties: 

35 (1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, thr; 

(3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 
40 (6) aromatic: trp, tyr, phe. 

Non-conservative substitutions will entail exchanging a member of one of these classes for 
another class. Such substituted residues also may be introduced into the conservative substitution sites or, 
more preferably, into the remaining (non-conserved) sites. 

The variations can be made using methods known in the art such as oligonucleotide-mediated 
45 (site-directed) mutagenesis, alanine scanning, and PGR mutagenesis. Site-directed mutagenesis [Carter et 
al., NucL Acids Res., 13:4331 (1986); Zoller et al., Nucl. Acids Res., 10:6487 (1987)], cassette 
mutagenesis [Wells et al., Gene, 34:315 (1985)], restriction selection mutagenesis [Wells et al., Philos. 
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Trans. R. Soc. London SerA, 317:415 (1986)] or other known techniques can be performed on the cloned 
DNA to produce the PRO variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino 
5 acids. Such amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred 
scanning amino acid among this group because it eliminates the side-chain beyond the beta-carbon and is 
less likely to alter the maia-charn conformation of the variant [Cunningham and Wells, Science, 244: 
1081-1085 (1989)]. Alanine is also typically preferred because it is the most common amino acid. 
Further, it is frequently foimd in both bmried and exposed positions [Creighton, The Proteins, (W.H. 

10 Freeman & Co., N.Y.); Chothia, J. Mol. Biol., 150: 1 (1976)]. If alanine substitution does not yield 
adequate amounts of variant, an isoteric amino acid can be used. 
C. Modifications of PRO 
Covalent modifications of PRO are included within the scope of this invention. One type of 
covalent modification includes reacting targeted amino acid residues of a PRO polypeptide with an organic 

15 derivatizing agent that is capable of reacting with selected side chains or the N- or C- terminal residues of 
the PRO. Derivatization with bifunctional agents is useful, for instance, for crosslinking PRO to a water- 
insoluble support matrix or surface for use in the method for purifying anti-PRO antibodies, and vice- 
versa. Commonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, 
glutaraldehyde, N-hydroxysuccinimide esters, for example, esters wiHi 4-azidosalicylic acid, 

20 homobifimctional imidoesters, including disuccinimidyl esters such as 3,3'- 
dithiobis(succiQimidylpropionate), bifunctional maleimides such as bis-N-maleimido-l,8-octane and agents 
such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the 
corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proliae and lysine, 

25 phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the a-amino groups of 
lysine, arginine, and histidine side chains [T.E. Creighton, Proteins: Structure and Molecular Properties, 
W.H. Freeman & Co., San Francisco, pp. 79-86 (1983)], acetylation of the N-terminal amine, and 
amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the PRO polypeptide included within the scope of this 

30 invention comprises altering the native glycosylation pattern of the polypeptide. "Altering the native 
glycosylation pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties 
found in native sequence PRO (either by removing the underlying glycosylation site or by deleting the 
glycosylation by chemical and/or enzymatic means), and/or adding one or more glycosylation sites that are 
not present in the native sequence PRO. In addition, the phrase includes qualitative changes in the 

35 glycosylation of the native proteins, involving a change in the nature and proportions of the various 
carbohydrate moieties present. 

Addition of glycosylation sites to the PRO polypeptide may be accomplished by altering tlie 
anaino acid sequence. The alteration may be made, for example, by the addition of, or substitution by, 

70 



wo 2004/041170 



PCT/US2003/034312 



one or more serine or threonine residues to the native sequence PRO (for O-linked glycosylation sites). 
The PRO amino acid sequence may optionally be altered through changes at the DNA level, particularly 
by mutating the DNA encoding the PRO polypeptide at preselected bases such that codons are generated 
that will translate into the desired amino acids. 
5 Another means of increasing the number of carbohydrate moieties on the PRO polypeptide is by 

chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art, 
e.g., in WO 87/05330 published 11 September 1987, and in Aplin and Wriston, CRC Crit. Rev. 
Biochem., pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the PRO polypeptide may be accomplished 

10 chemically or enzymatically or by mutational substitution of codons encoding for amino acid residues that 
serve as targets for glycosylation. Chemical deglycosylation techniques are known in the art and 
described, for instance, by Hakimuddin, et al.. Arch. Biochem. Biophys., 259: 52 (1987) and by Edge et 
al.. Anal. Biochem., 118: 131 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can 
be achieved by the use of a variety of endo- and exo-glycosidases as described by Thotakura et al., Meth. 

15 Enzvmol., 138:350 (1987). 

Another type of covalent modification of PRO comprises linking the PRO polypeptide to one of a 
variety of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or 
polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 
4,670,417; 4,791,192 or 4,179,337. 

20 The PRO of the present invention may also be modified in a way to form a chimeric molecule 

comprising PRO fused to another, heterologous polypeptide or amino acid sequence. 

In one embodiment, such a chimeric molecule comprises a fusion of the PRO with a tag 
polypeptide which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag 
is generally placed at the amino- or carboxyl- terminus of the PRO. The presence of such epitope-tagged 

25 forms of the PRO can be detected using an antibody against the tag polypeptide. Also, provision of the 
epitope tag enables the PRO to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. Various tag polypeptides and their respective 
antibodies are well known in the art. Examples include poly-histidine (poly-his) or poly-histidine-glycine 
(poly-his-gly) tags; the flu HA tag polypeptide and its antibody 12CA5 [Field et al., Mol. Cell. Biol., 

30 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto [Evan et 
al.. Molecular and Cellular Biology. 5:3610-3616 (1985)]; and the Herpes Simplex virus glycoprotein D 
(gD) tag and its antibody [Paborsky et al.. Protein Engineering, 3(6):547-553 (1990)]. Other tag 
polypeptides include the Flag-peptide [Hopp et al., BioTechnoloev. 6:1204-1210 (1988)]; the KT3 epitope 
peptide [Martm et al.. Science, 255: 192-194 (1992)]; an alpha-tubulin epitope peptide [Skinner et al., L 

35 Biol. Chem., 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide tag [Lutz-Freyermuth et al., 
Proc. Natl. Acad. Sci. USA, 87:6393-6397 (1990)]. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the PRO with an 
immxmoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric 
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molecule (also referred to as an "iinmunoadhesin")> such a fusion could be to the Fc region of an IgG 
molecule. The Ig fusions preferably include the substitution of a soluble (transmembrane domain deleted 
or inactivated) form of a PRO polypeptide in place of at least one variable region within an Ig molecule. 
In a particularly preferred embodiment, the immunoglobulin fusion includes the hinge, CH2 and CHS, or 
5 tlie hinge, CHI, CH2 and CHS regions of an IgGl molecule. For the production of immunoglobulin 
fusions see also US Patent No, 5,428,130 issued June 27, 1995. 
D. Preparation of PRO 

The description below relates primarily to production of PRO by culturing cells transformed or 
transfected with a vector containing PRO nucleic acid. It is, of course, contemplated that alternative 

10 methods, which are well known in the art, may be employed to prepare PRO. For instance, the PRO 
sequence, or portions thereof, may be produced by direct peptide synthesis using solid-phase techniques 
[see, e.g., Stewart et al., Solid-Phase Peptide Svnthesis, W.H. Freeman Co., San Francisco, CA (1969); 
Merrifield, J. Am. Chem. Soc, 85:2149-2154 (1963)]. In vitro protein synthesis may be performed using 
3aianual techniques or by automation. Automated synthesis may be accomplished, for instance, using an 

15 Applied Biosystems Peptide Synthesizer (Foster City, CA) using manufacturer's instructions. Various 
portions of the PRO may be chemically sjoithesized separately and combined using chemical or enzymatic 
methods to produce the fiill-length PRO. 

1. Isolation of DNA Encoding PRO 
DNA encoding PRO may be obtained from a cDNA library prepared from tissue believed to 

20 possess the PRO mRNA and to express it at a detectable level. Accordingly, human PRO DNA can be 
conveniently obtained from a cDNA library prepared from hirnian tissue, such as described in the 
Examples, The PRO-encoding gene may also be obtained from a genomic library or by known synthetic 
procedures (e.g., automated nucleic acid synthesis). 

Libraries can be screened with probes (such as antibodies to the PRO or oligonucleotides of at 

25 least about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening 
the cDNA or genomic library with the selected probe may be conducted using standard procedures, such 
as described in Sambrook et al.. Molecular Cloning: A Laboratorv Manual (New York: Cold Spring 
Harbor Laboratory Press, 1989). An alternative means to isolate the gene encoding PRO is to use PCR 
methodology [Sambrook et al., supra: Dieffenbach et al., PCR Primer: A Laboratorv Manual (Cold 

30 Spring Harbor Laboratory Press, 1995)]. 

The Examples below describe techniques for screening a cDNA library. The oligonucleotide 
sequences selected as probes should be of sufficient length and sufficiently unambiguous that false 
positives are minimized. The oligonucleotide is preferably labeled such that it can be detected upon 
hybridization to DNA in the library being screened. Methods of labeling are well known in the art, and 

35 include the use of radioiabels like ^^P-labeled ATP, biotinylation or enzyme labeling. Hybridization 
conditions, including moderate stringency and high stringency, are provided in Sambrook et al,, supra. 

Sequences identified in such library screening methods can be compared and aligned to other 
known sequences deposited and available in public databases such as GenBank or other private sequence 
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databases. Sequence identity (at either the amino acid or nucleotide level) within defined regions of the 
molecule or across the full-length sequence can be determined using methods known hi the art and as 
described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or 
5 genomic libraries using the deduced amino acid sequence disclosed herein for the first time, and, if 
necessary, using conventional primer extension procedures as described in Sambrook et al., supra, to 
detect precursors and processing intermediates of mRNA that may not have been reverse-transcribed into 
cDNA. 

10 2. Selection and Transformation of Host Cells 

Host cells are transfected or transformed with expression or cloning vectors described herein for 
PRO production and cultured in conventional nutrient media modified as appropriate for inducing 
promoters, selecting transformants, or amplifying the genes encoding the desired sequences. The culture 
conditions, such as media, temperature, pH and the like, can be selected by the skilled artisan witiiout 

15 xmdue experimentation. In general, principles, protocols, and practical techniques for maximizing the 
productivity of cell cultures can be found in Mammalian Cell Biotechnologv: a Practical Approach, M. 
Butler, ed. (IRL Press, 1991) and Sambrook et al., supra. 

Methods of eukaryotic cell transfection and prokaryotic cell transformation are known to the 
ordinarily skilled artisan, for example, CaCl2, CaP04, liposome-mediated and electroporation. Depending 

20 on the host cell used, transformation is performed using standard techniques appropriate to such cells. 
The calcium treatment employing calcium chloride, as described in Sambrook et al., supra, or 
electroporation is generally used for prokaryotes. Infection with Agrobacterium tumefaciens is used for 
transformation of certain plant cells, as described by Shaw et al.. Gene, 23:315 (1983) and WO 89/05859 
published 29 June 1989. For mammalian cells without such cell walls, the calcium phosphate precipitation 

25 method of Graham and van der Eb, Virology, 52:456-457 (1978) can be employed. General aspects of 
mammalian cell host system transfections have been described in U.S. Patent No. 4,399,216. 
Transformations into yeast are typically carried out according to the method of Van Solingen et al., JL 
' Bact., 130:946 (1977) and Hsiao et al., Proc. Natl. Acad. Sci. (USA). 76:3829 (1979). However, other 
methods for introducing DNA into cells, such as by nuclear microinjection, electroporation, bacterial 

30 protoplast fusion with intact cells, or polycations, e.g., polybrene, polyomithine, may also be used. For 
various techniques for transforming mammalian cells, see Keown et al., Methods in Enzvmology, 
185:527-537 (1990) and Mansour et al.. Nature. 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, 
yeast, or higher eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as 

35 Gram-negative or Gram-positive organisms, for example, Enterobacteriaceae such as E. coli. Various E. 
coli strains are publicly available, such as E. coli K12 strain MM294 (ATCC 31,446); E, coli X1776 
(ATCC 31,537); E. coli strain W3110 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable 
prokaryotic host cells include Enterobacteriaceae such as Escherichia, e.g., E. coli, Enterobacter, 

73 



wo 2004/041170 



PCT/US2003/034312 



Erwinia, Klebsiella, Proteus, Salmonella, e.g.. Salmonella typhimurium, Serratia, e.g., Serratia 
marcescans, and Shigella, as well as Bacilli such as B. subtilis and B. licheniformis (e.g., B, licheniformis 
41P disclosed in DD 266,710 published 12 April 1989), Pseudomonas such as P. aeruginosa, and 
Streptomyces. These examples are illustrative rather than limiting. Strain W3110 is one particularly 
5 preferred host or parent host because it is a common host stram for recombinant DNA product 
fermentations. Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For example, 
strain W3110 may be modified to effect a genetic mutation in the genes encoding proteuis endogenous to 
the host, with examples of such hosts iacluding E. colt W3110 straia 1A2, which has the complete 
genotype touA ; E, coli W3110 strain 9E4, which has the complete genotype tonA ptr3; E. coli W3110 

10 strain 27C7 (ATCC 55,244), which has the complete genotype tonA ptr3 phoA E15 (argF-lac)169 degP 
ompT kan''; E. coli W3110 strain 37D6, which has the complete genotype tonA ptr3 phoA E15 (argF- 
lac)169 degP ompT rbs7 ilvG kan''\ E. coli W3110 strain 40B4, which is strain 37D6 with a non- 
kanamycin resistant degP deletion mutation; and an E, coli strain having mutant periplasmic protease 
disclosed in U.S. Patent No. 4,946,783 issued 7 August 1990. Alternatively, in vitro methods of cloning, 

15 e.g., PGR or other nucleic acid polymerase reactions, are suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fiingi or yeast are suitable 
cloning or expression hosts for PRO-encoding vectors. Saccharomyces cerevisiae is a commonly used 
lower eukaryotic host microorganism. Others include Schizosaccharomyces pombe (Beach and Nurse, 
Nature, 290: 140 [1981]; EP 139,383 published 2 May 1985); Kluyveromyces hosts (U.S. Patent No. 

20 4,943,529; Fleer et al., Bio/Technology, 9:968-975 (1991)) such as, e.g., K. lactis (MW98-8C, CBS683, 
CBS4574; Louvencourt et al., J. BacterioL. 154(2): 737-742 [1983]), K, fragilis (ATCC 12,424), K, 
bulgaricus (ATCC 16,045), K, wickeramii (ATCC 24,178), K. waltii (ATCC 56,500), K. drosophilamm 
(ATCC 36,906; Van den Berg et al., Bio/Technology. 8:135 (1990)), K, thermotolerans, and 
marxianus; yarrowia (EP 402,226); Pichia pastoris (EP 183,070; Sreekrishna et al., J. Basic Microbiol., 

25 28:265-278 [1988]); Candida; Trichodenna reesia (EP 244,234); Neurospora crassa (Case et al., Proc. 
Natl. Acad. Sci. USA. 76:5259-5263 [1979]); Schwanniomyces such as Sckwanniomyces occidentalis (EP 
394,538 published 31 October 1990); and filamentous fungi such as, e.g., Neurospora, Penicillium, 
Tolypocladium (WO 91/00357 published 10 January 1991), and Aspergillus hosts such as A, nidulans 
(Ballance et al., Biochem. Biophvs. Res. Commun.. 112:284-289 [1983]; Tilbum et al.. Gene, 26:205- 

30 221 [1983]; Yelton et al., Proc. Natl. Acad. Sci. USA. 81: 1470-1474 [1984]) and A. niger (KeUy and 
Hynes, EMBO J., 4:475-479 [1985]). Methylotropic yeasts are suitable herem and include, but are not 
limited to, yeast capable of growth on methanol selected from the genera consisting of Hansenula, 
Candida, Kloeckera, Pichia, Saccharomyces, Torulopsis, and Rhodotorula. A list of specific species that 
are exemplary of this class of yeasts may be found in C. Anthony, The Biochemistrv of Metfavlotrophs. 

35 269 (1982). 

Suitable host cells for the expression of glycosylated PRO are derived from multicellular 
organisms. Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9, 
as well as plant cells. Examples of useful mammalian host cell lines include Chinese hamster ovary 
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(CHO) and COS cells. More specific examples include monkey kidney CVl line transformed by SV40 
(COS-7, ATCC CRL 1651); human embryonic kidney line (293 or 293 cells subcloned for growth in 
suspension culture, Graham et al., J. Gen Virol., 36:59 (1977)); Chinese hamster ovary cells/-DHFR 
(CHO, Urlaub and Chasin, Proc. Natl. Acad. Sci. USA. 77:4216 (1980)); mouse Sertoli cells (TM4, 
5 Mather, Biol. Reprod.. 23:243-251 (1980)); human lung cells (W138, ATCC CCL 75); human liver cells 
(Hep G2, HB 8065); and mouse mammary tumor (MMT 060562, ATCC CCL51). The selection of the 
appropriate host cell is deemed to be within the skill in tlae art. 

3. Selection and Use of a Replicable Vector 

The nucleic acid (e.g., cDNA or genomic DNA) encoding PRO may be inserted into a replicable 

10 vector for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. 
The vector may, for example, be in the form of a plasmid, cosmid, viral particle, or phage. The 
appropriate nucleic acid sequence may be inserted into the vector by a variety of procedures. In general, 
DNA is inserted into an appropriate restriction endonuclease site(s) using techniques known in the art. 
Vector components generally include, but are not limited to, one or more of a signal sequence, an origin 

15 of replication, one or more marker genes, an enhancer element, a promoter, and a transcription 
termination sequence. Construction of suitable vectors containing one or more of these components 
employs standard ligation techniques which are known to the skilled artisan. 

The PRO may be produced recombinantly not only directly, but also as a ftision polypeptide with 
a heterologous polypeptide, which may be a signal sequence or other polypeptide having a specijBc 

20 cleavage site at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may 
be a component of the vector, or it may be a part of the PRO-encoding DNA that is inserted into the 
vector. The signal sequence may be a prokaryotic signal sequence selected, for example, from the group 
of the alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. For yeast secretion 
the signal sequence may be, e.g., the yeast invertase leader, alpha factor leader (including Saccharomyces 

25 and Kluyveromyces a-factor leaders, the latter described in U.S. Patent No. 5,010,182), or acid 
phosphatase leader, the C. albicans glucoamylase leader (EP 362,179 published 4 April 1990), or the 
signal described in WO 90/13646 published 15 November 1990. In manmialian cell expression, 
mammalian signal sequences may be used to direct secretion of the protein, such as signal sequences from 
secreted polypeptides of the same or related species, as well as viral secretory leaders, 

30 Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to 

replicate in one or more selected host cells. Such sequences are well known for a variety of bacteria, 
yeast, and viruses. The origin of replication from the plasmid pBR322 is suitable for most Gram-negative 
bacteria, the 2ft plasmid origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, 
VSV or BPV) are useful for cloning vectors in mammalian cells. 

35 Expression and cloning vectors will t3rpically contain a selection gene, also termed a selectable 

marker. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, 
e.g., ampicillin, neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) 
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supply critical nutrients not available from complex media, e.g., the gene encoding D-alanine racemase 
for Bacilli, 

An example of suitable selectable markers for mammalian cells are those that enable the 
identification of cells competent to take up the PRO-encoding nucleic acid, such as DHFR or thymidine 
5 kinase. An appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient in 
DHFR activity, prepared and propagated as described by Urlaub et al., Proc. Natl. Acad. Sci. USA, 
77:4216 (1980). A suitable selection gene for use in yeast is the trpl gene present in the yeast plasmid 
YRp7 [Stinchcomb et al.. Nature. 282:39 (1979); IGngsman et al.. Gene, 7:141 (1979); Tschemper et al.. 
Gene, 10:157 (1980)]. The trpl gene provides a selection marker for a mutant strain of yeast lacking the 

10 ability to grow in tryptophan, for example, ATCC No. 44076 or PEP4-1 [Jones, Genetics, 85:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the PRO-encoding 
nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells 
are well known. Promoters suitable for use with prokaryotic hosts include the P-lactamase and lactose 
promoter systems [Chang et al.. Nature. 275:615 (1978); Goeddel et al., Nature, 281:544 (1979)], 

15 alkaline phosphatase, a tryptophan (trp) promoter system [Goeddel, Nucleic Acids Res., 8:4057 (1980); 
EP 36,776], and hybrid promoters such as the tac promoter [deBoer et al., Proc. Natl. Acad. Sci. USA. 
80:21-25 (1983)]. Promoters for use in bacterial systems also will contain a Shine-Dalgamo (S.D.) 
sequence operably linked to the DNA encoding PRO. 

Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 

20 phosphoglycerate kinase [Hitzeman et al., J. Biol. Chem., 255:2073 (1980)] or other glycolytic enzymes 
[Hess et al., J. Adv. Enzyme Reg.. 7:149 (1968); Holland, Biochemistry, 17:4900 (1978)], such as 
enolase, glyceraldehyde-3 -phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
phosphofructokinase, glucose-6-phosphate isomerase, 3 -phosphoglycerate mutase, pyruvate kinase, 
triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. 

25 Other yeast promoters, which are inducible promoters having the additional advantage of 

transcription controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, 
isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, 
metallothionein, glyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible for maltose and 
galactose utilization. Suitable vectors and promoters for use in yeast expression are further described in 

30 EP 73,657. 

PRO transcription from vectors in mammalian host cells is controlled, for example, by promoters 
obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,211,504 published 5 
July 1989), adenovirus (such as Adenovirus 2), bovine papiUoma virus, avian sarcoma virus, 
cytomegalovirus, a retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous 
35 manmialian promoters, e.g., the actin promoter or an immunoglobulin promoter, and from heat-shock 
promoters, provided such promoters are compatible with the host cell systems. 

Transcription of a DNA encoding the PRO by higher eukaryotes may be increased by inserting an 
enhancer sequence into tlae vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 
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300 bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known 
from mammalian genes (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one 
will use an enliancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of 
the replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer 
5 on the late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into the 
vector at a position 5* or 3' to the PRO coding sequence, but is preferably located at a site 5' from the 
promoter. 

Expression vectors used in eukaryotic host cells (yeast, fangi, insect, plant, animal, human, or 
nucleated cells from other multicellular organisms) will also contain sequences necessary for the 
10 termination of transcription and for stabilizing the mRNA. Such sequences are conunonly available from 
the 5' and, occasionally 3', untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions 
contain nucleotide segments transcribed as polyadenylated fragments in the untranslated portion of the 
mRNA encoding PRO. 

Still other methods, vectors, and host cells suitable for adaptation to the synthesis of PRO in 
15 recombinant vertebrate cell culture are described m Gething et al.. Nature. 293:620-625 (1981); Mantei et 
al.. Nature. 281:40-46 (1979); EP 117,060; and EP 117,058. 

4. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting. Northern blotting to quantitate the transcription of mRNA [Thomas, Proc. 

20 Natl. Acad. Sci. USA. 77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using 
an appropriately labeled probe, based on the sequences provided herein. Alternatively, antibodies may be 
employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA 
hybrid duplexes or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be 
carried out where the duplex is bound to a surface, so that upon the formation of duplex on the surface, 

25 the presence of antibody bound to the duplex can be detected. 

Gene ejqpression, alternatively, may be measured by inmiunological methods, such as 
inamunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to 
quantitate directly the expression of gene product. Antibodies useful for immunohistochemical staining 
and/or assay of sample fluids may be either monoclonal or polyclonal, and may be prepared in any 

30 mammal. Conveniently, the antibodies may be prepared against a native sequence PRO polypeptide or 
against a synthetic peptide based on the DNA sequences provided herein or against exogenous sequence 
fused to PRO DNA and encoding a specific antibody epitope. 

5. Purification of Polypeptide 

Forms of PRO may be recovered from culture medium or from host cell ly sates. If membrane- 
35 boimd, it can be released from the membrane using a suitable detergent solution (e.g. Triton-X 100) or by 
enzymatic cleavage. Cells employed in expression of PRO can be disrupted by various physical or 
chemical means, such as freeze-thaw cycling, sonication, mechanical disruption, or cell lysing agents. 
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It may be desired to purify PRO from recombinant cell proteins or polypeptides. The following 
procedmres are exemplary of suitable purification procedures: by fractionation on an ion-exchange column; 
ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such 
as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for 
5 example, Sephadex G-75; protein A Sepharose columns to remove contaminants such as IgG; and metal 
chelating columns to bind epitope-tagged forms of the PRO. Various methods of protein purification may 
be employed and such methods are known in the art and described for example in Deutscher, Methods in 
Enzvmology, 182 (1990); Scopes, Protein Purification: Principles and Practice, Springer- Verlag, New 
York (1982). The purification step(s) selected will depend, for example, on the nature of the production 
10 process used and the particular PRO produced. 

E. Tissue Distribution 

The location of tissues expressing the PRO can be identified by determining mRNA expression in 
various human tissues. The location of such genes provides information about which tissues are most 
likely to be affected by the stimulating and inhibiting activities of the PRO polypeptides. The location of 
15 a gene in a specific tissue also provides sample tissue for the activity blocking assays discussed below. 

As noted before, gene expression in various tissues may be measured by conventional Southern 
blotting. Northern blotting to quantitate the transcription of mRNA (Thomas, Proc. NatL Acad, Set USA, 
77:5201-5205 [1980]), dot blotting (DNA analysis), or in situ hybridization, using an appropriately labeled 
probe, based on the sequences provided herein. Alternatively, antibodies may be employed that can 
20 recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or 
DNA-protein duplexes. 

Gene expression in various tissues, alternatively, may be measured by immunological methods, 
such as immunohistochemical staining of tissue sections and assay of cell culture or body fluids, to 
quantitate directly the expression of gene product. Antibodies useful for immunohistochemical staining 

25 and/or assay of sample fluids may be either monoclonal or polyclonal, and may be prepared in any 
mammal. Conveniendy, the antibodies may be prepared against a native sequence of a PRO polypeptide 
or against a synthetic peptide based on the DNA sequences encoding the PRO polypeptide or against an 
exogenous sequence fused to a DNA encoding a PRO polypeptide and encoding a specific antibody 
epitope. General techniques for generating antibodies, and special protocols for Northern blotting and in 

30 situ hybridization are provided below. 

F. Antibodv Binding Studies 

The activity of the PRO polypeptides can be further verified by antibody binding studies, in 
which the ability of anti-PRO antibodies to inhibit the effect of the PRO polypeptides, respectively, on 
tissue cells is tested. Exemplary antibodies include polyclonal, monoclonal, humanized, bispecific, and 
35 heteroconjugate antibodies, the preparation of which will be described hereinbelow. 

Antibody binding studies may be carried out in any known assay method, such as competitive 
binding assays, direct and indirect sandwich assays, and immunoprecipitation assays. Zola, Monoclonal 
Antibodies: A Manual of Techniques, pp. 147-158 (CRC Press, Inc., 1987). 
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Competitive binding assays rely on the ability of a labeled standard to compete with the test 
sample analyte for binding with a limited amount of antibody. The amount of target protein in the test 
sample is inversely proportional to the amount of standard that becomes bound to the antibodies. To 
facilitate determining the amount of standard that becomes bound, the antibodies preferably are 
5 insolubilized before or after the competition, so that the standard and analyte that are bound to the 
antibodies may conveniently be separated from tlie standard and analyte which remain unbound. 

Sandwich assays involve the use of two antibodies, each capable of binding to a different 
immxmogenic portion, or epitope, of the protein to be detected. In a sandwich assay, the test sample 
analyte is bound by a first antibody which is immobilized on a solid support, and thereafter a second 
10 antibody binds to the analyte, thus forming an insoluble three-part complex. See, e.g., US Pat No. 
4,376,110. The second antibody may itself be labeled with a detectable moiety (direct sandwich assays) or 
may be measured using an anti-immunoglobulin antibody that is labeled with a detectable moiety (indirect 
sandwich assay). For example, one type of sandwich assay is an ELISA assay, in which case the 
detectable moiety is an enzyme. 
15 For inmiunohistochemistry, the tissue sample may be fresh or frozen or may be embedded in 

paraffin and fixed with a preservative such as formalin, for example. 

G. Cell-Based Assays 

Cell-based assays and animal models for immune related diseases can be used to further 
understand the relationship between the genes and polypeptides identified herein and the development and 

20 pathogenesis of immime related disease. 

In a different approach, cells of a cell type known to be involved in a particular immune related 
disease are transfected with the cDNAs described herein, and the ability of these cDNAs to stimulate or 
inhibit immune function is analyzed. Suitable cells can be transfected with the desired gene, and 
monitored for immune function activity. Such transfected cell lines can then be used to test the ability of 

25 poly- or monoclonal antibodies or antibody compositions to inhibit or stimulate immune function, for 
example to modulate monoc3^e/macrophage proliferation or inflammatory cell infiltration. Cells 
transfected with the coding sequences of the genes identified herein can further be used to identify drug 
candidates for the treatment of immune related diseases. 

In addition, primary cultures derived from transgenic animals (as described below) can be used in 

30 the cell-based assays herein, although stable cell lines are preferred. Techniques to derive continuous cell 
lines from transgenic animals are well known in the art (see, e.g.. Small et ah, MoL Cell. Biol. 5: 642- 
648 [1985]). 

The use of an agonist stimulating compound has also been validated experimentally. Activation 
of 4-lBB by treatment with an agonist anti-4-lBB antibody enhances eradication of tumors. Hellstrom, 1. 
35 and Hellstrom, K. E., CrlL Rev. ImmwioL (1998) 18:1. Inmiunoadjuvant therapy for treatment of 
tumors, described in more detail below, is another example of the use of the stimulating compoimds of 
the invention. 
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Alternatively, an umnune stimulating or enhancing effect can also be achieved by administration 
of a PRO which has vascular permeability enhancing properties. Enhanced vascular permeability would 
be beneficial to disorders which can be attenuated by local infiltration of immune cells (e,g,, 
monocytes/macrophages, eosinophils, PMNs) and inflammation. 
5 On the other hand, PRO polypeptides, as well as other compounds of the invention, which are 

direct inhibitors of monocj^e/macrophage proliferation/activation, lymphokine secretion, and/or vascular 
permeability can be directly used to suppress the immune response. These compounds are useful to 
reduce the degree of the immune response and to treat inunime related diseases characterized by a 
hyperactive, superoptimal, or autoimmune response. The use of compoimd which suppress vascular 
10 permeability would be expected to reduce inflammation. Such uses would be beneficial in treating 
conditions associated with excessive inflammation. 

Alternatively, compounds, e.g., antibodies, which bind to stimulating PRO polypeptides and 
block the stimulating effect of these molecules produce a net inhibitory effect and can be used to suppress 
the monocyte/macrophage mediated immime response by inhibiting monoc)^e/macrophage 
15 proliferation/activation and/or lymphokine secretion. Blocking the stimulating effect of the polypeptides 
suppresses the immune response of the mammal. 

H. Animal Models 

The results of the cell based in vitro assays can be further verilBied using in vivo animal models 
and assays for monocyte/macrophage function. A variety of well known animal models can be used to 

20 further understand the role of the genes identified herein in the development and pathogenesis of immune 
related disease, and to test the efficacy of candidate therapeutic agents, including antibodies, and other 
antagonists of the native polypeptides, including small molecule antagonists. The in vivo nature of such 
models makes them predictive of responses in human patients. Animal models of immune related diseases 
include both non-recombinant and recombinant (transgenic) animals. Non-recombinant animal models 

25 include, for example, rodent, e.g., murine models. Such models can be generated by introducing cells 
into syngeneic mice using standard techniques, e.g., subcutaneous injection, tail vein injection, spleen 
implantation, intraperitoneal implantation, implantation under the renal capsule, etc. 

Graft-versus-host disease occurs when immunocompetent cells are transplanted into 
immunosuppressed or tolerant patients. The donor cells recognize and respond to host antigens. The 

30 response can vary from life threatening severe inflanmiation to mild cases of diarrhea and weight loss. 
Graft-versus-host disease models provide a means of assessmg monocyte/macrophage reactivity against 
MHC antigens and minor transplant antigens. A suitable procedure is described in detail in Current 
Protocols in Immunology, above, unit 4.3. 

Animal models for delayed type hypersensitivity provides an assay of cell mediated immune 

35 function as well. In chronic Delayed type hypersensitivity (DTH) reactions, monoc5^es that have 
differentiated into macrophages lead to the destruction of host tissue which is replaced by fibrous tissue 
(fibrosis). 
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Contact hypersensitivity is a simple delayed type hypersensitivity in vivo assay of cell mediated 
immune function. In this procedure, cutaneous exposure to exogenous haptens which gives rise to a 
delayed type hypersensitivity reaction which is measured and quantitated. Contact sensitivity involves an 
initial sensitizing phase followed by an elicitation phase. The elicitation phase occurs when the T 
5 lymphocj^es encounter an antigen to which they have had previous contact. Swelling and inflammation 
occur, making this an excellent model of human allergic contact dermatitis. At this point, monoc5^es 
leave the blood and differentiate in to macrophages. A suitable procedure is described in detail in Current 
Protocols in Immunology, Eds. J. E. Cologan, A. M. Kruisbeek, D. H, Maigulies, E. M. Shevach and 
W. Strober, John Wiley & Sons, Inc., 1994, unit 4.2. See also Grabbe, S. and Schwarz, T, Immun, 

10 Today 19 (1): 37-44 (1998) 

Recombioant (transgenic) animal models can be engineered by introducing the coding portion of 
the genes identified herein into the genome of animals of interest, using standard techniques for producing 
transgenic animals. Animals that can serve as a target for transgenic manipulation include, without 
limitation, mice, rats, rabbits, guinea pigs, sheep, goats, pigs, and non-human primates, e.g., baboons, 

15 chimpanzees and monkeys. Techniques known in the art to introduce a transgene into such animals 
include pronucleic microinjection (Hoppe and Wanger, U.S. Patent No. 4,873,191); retrovirus-mediated 
gene transfer into germ lines (e.g.. Van der Putten et at., Proc. Natl. Acad. Set USA 82, 6148-615 
[1985]); gene targeting in embryonic stem cells (Thompson et al.. Cell 56, 313-321 [1989]); 
electroporation of embryos (Lo, Mol Cel. Biol. 3, 1803-1814 [1983]); sperm-mediated gene transfer 

20 (Lavitrano et aL, Cell 57, 717-73 [1989]). For review, see, for example, U.S. Patent No. 4,736,866. 

For the purpose of the present invention, transgenic animals include those that carry the transgene 
only in part of their cells ("mosaic animals"). The transgene can be integrated either as a single transgene, 
or in concatamers, e.g., head-to-head or head-to-tail tandems. Selective introduction of a transgene into a 
particular cell type is also possible by following, for example, the technique of Lasko et al., Proc. Natl. 

25 Acad. Sci. USA 89, 6232-636 (1992). 

The expression of the transgene in transgenic animals can be monitored by standard techniques. 
For example, Southern blot ajoalysis or PCR amplification can be used to verify the integration of the 
transgene. The level of mRNA expression can then be analyzed using techniques such as in situ 
hybridization. Northern blot analysis, PCR, or immunocytochemistry. 

30 The animals may be further examined for signs of immune disease pathology, for example by 

histological examination to determine infiltration of immune ceUs into specific tissues. Blocking 
experiments can also be performed in which the transgenic animals are treated with the compounds of the 
invention to determine the extent of the monoc3^es/macrophage proliferation stimulation or inhibition of 
the compounds. In these experiments, blocking antibodies which bmd to the PRO polypeptide, prepared 

35 as described above, are administered to the animal and the effect on immune function is determined. 

Alternatively, "knock out" animals can be constructed which have a defective or altered gene 
encoding a polypeptide identified herein, as a result of homologous recombination between the endogenous 
gene encoding the polypeptide and altered genomic DNA encoding the same polypeptide introduced into 
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an embryonic cell of the animal. For example, cDNA encoding a particular polypeptide can be used to 
clone genomic DNA encoding that polypeptide in accordance with established techniques. A portion of 
the genomic DNA encoding a particular polypeptide can be deleted or replaced with another gene, such as 
a gene encoding a selectable marker which can be used to monitor integration. Typically, several 
5 kilobases of unaltered flanking DNA (both at the 5* and 3' ends) are included in the vector [see e>g., 
Thomas and Capecchi, Cell, 51.: 503 (1987) for a description of homologous recombination vectors]. The 
vector is introduced into an embryonic stem cell liue (e.g., by electroporation) and cells in which the 
introduced DNA has homologously recombiaed with the endogenous DNA are selected [see e,g,, Li et aL, 
Cell, 69:915 (1992)]. The selected cells are then injected into a blastocyst of an animal (e.g., a mouse or 

10 rat) to form aggregation chimeras [see e.g., Bradley, in Teratocarcinornas and Embryonic Stem Cells: A 
Practical Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 113-152]. A chimeric embryo can 
then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to term to 
create a "knock out" animal. Progeny harboring the homologously recombined DNA in their germ cells 
can be identified by standard techniques and used to breed animals in which all cells of the animal contain 

15 the homologously recombined DNA. Knockout animals can be characterized for instance, for their ability 
to defend against certain pathological conditions and for their development of pathological conditions due 
to absence of the polypeptide. 

I. ImmunoAdjuvant Therapy 

In one embodiment, the immxmostimulating compounds of tlie invention can be used in 

20 immunoadjuvant therapy for the treatment of tumors (cancer). It is now well established that 
monocytes/macrophages recognize himian tumor specific antigens. One group of tumor antigens, encoded 
by the MAGE, BAGE and GAGE families of genes, are silent in all adult normal tissues , but are 
expressed in significant amounts in tumors, such as melanomas, lung tumors, head and neck tumors, and 
bladder carcinomas. DeSmet, C. et aL, (1996) Proc. Natl, Acad. Sci. USA, 93:7149, It has been shown 

25 that stimulation of immune cells induces tumor regression and an antitumor response both in vitro and in 
vivo, Melero, I. et al.. Nature Medicine (1997) 3:682; Kwon, E. D. et aL, Proc. Natl. Acad. Sci. USA 
(1997) 94: 8099; Lynch, D. H. et al. Nature Medicine (1997) 3:625; Finn, O. J. and Lotze, M. T., J. 
Immunol. (1998) 21:114. The stimulatory compounds of the invention can be administered as adjuvants, 
alone or together with a growth regulating agent, cytotoxic agent or chemotherapeutic agent, to stimulate 

30 monocyte/macrophage proliferation/activation and an antitumor response to tumor antigens. The growth 
regulating, cytotoxic, or chemotherapeutic agent may be administered in conventional amounts using 
known administration regimes. Immunostimulating activity by the compounds of the invention allows 
reduced amounts of the growth regulating, cytotoxic, or chemotherapeutic agents thereby potentially 
lowering the toxicity to the patient. 

35 J. Screening Assays for Drug Candidates 

Screening assays for drug candidates are designed to identify compounds that bind to or complex 
with the polypeptides encoded by the genes identified herein or a biologically active fragment thereof, or 
otherwise interfere with the interaction of the encoded polypeptides with other cellular proteins. Such 
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screening assays will include assays amenable to high-throughput screening of chemical libraries, making 
them particularly suitable for identifying small molecule drug candidates. Small molecules contemplated 
include synthetic organic or inorganic compounds, including peptides, preferably soluble peptides, 
(poly)peptide-immunoglobulin fusions, and, in particular, antibodies including, without limitation, poly- 
and monoclonal antibodies and antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and 
chimeric or hrnnanized versions of such antibodies or fragments, as well as human antibodies and antibody 
fragments. The assays can be performed in a variety of formats, including protein-protein binding assays, 
biochemical screening assays, immunoassays and cell based assays, which are well characterized in the 
art. All assays are common in tliat they call for contacting the drug candidate with a polypeptide encoded 
by a nucleic acid identified herein under conditions and for a time sufficient to allow these two 
components to interact. 

In bindmg assays, the mteraction is bmding and the complex formed can be isolated or detected in 
the reaction mixture. In a particular embodiment, the polypeptide encoded by the gene identified herein or 
the drug candidate is immobilized on a solid phase, e.g., on a ndcrotiter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a 
solution of the polypeptide and drying. Alternatively, an inmaobilized antibody, e.g., a monoclonal 
antibody, specific for the polypeptide to be immobilized can be used to anchor it to a solid surface. The 
assay is performed by adding the non-immobilized component, which may be labeled by a detectable 
label, to the immobilized component, e.g,, the coated surface containing the anchored component. When 
the reaction is complete, the non-reacted components are removed, e.g., by washing, and complexes 
anchored on the solid surface are detected. When the originally non-immobilized component carries a 
detectable label, the detection of label immobilized on the surface indicates that complexing occurred. 
Where the originally non-immobilized component does not carry a label, complexing can be detected, for 
example, by using a labelled antibody specifically binding the immobilized complex. 

If the candidate compoimd interacts with but does not bind to a particular protein encoded by a 
gene identified herein, its interaction with that protein can be assayed by methods well known for detecting 
protein-protein interactions. Such assays include traditional approaches, such as, cross-linking, co- 
inmnmoprecipitation, and co-purification through gradients or chromatographic columns. In addition, 
protein-protein interactions can be monitored by using a yeast-based genetic system described by Fields 
and co-workers [Fields and Song, Nature (London) 34Q, 245-246 (1989); Chien et aL, Proc. Natl Acad, 
ScL USA 88, 9578-9582 (1991)] as disclosed by Chevray and Nathans, Proc. Natl Acad. ScL USA 89, 
5789-5793 (1991). Many transcriptional activators, such as yeast GAL4, consist of two physically 
discrete modular domains, one acting as the DNA-binding domain, while the other one functioning as the 
transcription activation domain. The yeast expression system described in the foregoing publications 
(generally referred to as the "two-hybrid system") takes advantage of this property, and employs two 
hybrid proteins, one in which the target protein is fused to the DNA-binding domam of GAL4, and 
another, in which candidate activating proteins are fused to the activation domain. The expression of a 
GALl-lacZ reporter gene under control of a GAL4-activated promoter depends on reconstitution of GAL4 
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activity via protein-protein interaction. Colonies containing interacting polypeptides are detected with a 
chromogenic substrate for P-galactosidase. A complete kit (MATCHMAKER'^^) for identifying protein- 
protein interactions between two specific proteins using the two-hybrid technique is commercially 
available from Clontech. This system can also be extended to map protein domains involved in specific 
5 protein interactions as well as to pinpoint amino acid residues that are crucial for these interactions. 

In order to fmd compounds that interfere with the interaction of a gene identified herein and other 
intra- or extracellular components can be tested, a reaction mixture is usually prepared contauiing the 
product of the gene and the intra- or extracellular component under conditions and for a time allowing for 
the interaction and binding of the two products. To test the ability of a test compound to inhibit binding, 
10 the reaction is run in the absence and in the presence of the test compound. In addition, a placebo may be 
added to a third reaction mixture, to serve as positive control. The binding (complex formation) between 
the test compound and the intra- or extracellular con^onent present in the mixture is monitored as 
described above. The formation of a conq)lex in the control reaction(s) but not in the reaction mixture 
containing the test compoxmd indicates that the test compound interferes with the interaction of the test 
15 compound and its reaction partner. 

K. Compositions and Methods for the Treatment of Immune Related Diseases 
The compositions useM in the treatment of immune related diseases include, without limitation, 
proteins, antibodies, small organic molecules, peptides, phosphopeptides, antisense and ribozyme 
molecules, triple helix molecules, etc. that uahibit or stimulate immune function, for example, monocyte 
20 proliferation/activation, lymphokine release, or immune cell infiltration. 

For example, antisense RNA and RNA molecules act to directly block the translation of mRNA 
by hybridizing to targeted mRNA and preventing protein translation. When antisense DNA is used, 
oligodeoxyribonucleotides derived from the translation initiation site, e.g., between about -10 and +10 
positions of the target gene nucleotide sequence, are preferred. 
25 Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 

Ribozymes act by sequence-specific hybridization to the complementary target RNA, followed by 
endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified 
by known techniques. For further details see, e.g., Rossi, Current Biology 4, A€^'A1\ (1994), and PCT 
publication No. WO 97/33551 (published September 18, 1997). 
30 Nucleic acid molecules in triple helix formation used to inhibit transcription should be single- 

stranded and composed of deoxynucleotides. The base composition of these oligonucleotides is designed 
such that it promotes triple helix formation via Hoogsteen base pairing rules, which generally require 
sizeable stretches of purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT 
publication No. WO 97/33551, supra. 
35 These molecules can be identified by any or any combination of the screening assays discussed 

above and/or by any other screening techniques well known for those skilled in the art. 
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L. Anti-PRQ Antibodies 

The present invention further provides anti-PRO antibodies. Exemplary antibodies include 
polyclonal, monoclonal, humanized, bispecific, and heteroconjugate antibodies, 

L Polyclonal Antibodies 

The anti-PRO antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 
antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a mammal, for 
example, by one or more injections of an immunizing agent and, if desired, an adjuvant. Typically, die 
immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous or 
intraperitoneal injections. The immunizing agent may include the PRO polypeptide or a fusion protein 
thereof. It may be useful to conjugate the immimizing agent to a protein known to be immunogenic in the 
mammal being immunized. Examples of such immunogenic proteins include but are not limited to 
keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. 
Examples of adjuvants which may be employed include Freund's complete adjuvant and MPL-TDM 
adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). The immunization protocol 
may be selected by one sldlled in the art without undue experimentation. 
2. Monoclonal Antibodies 

The anti-PRO antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies 
may be prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature, 
256: 495 (1975). In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically 
inoutnunized with an innmiunizing agent to elicit lymphocytes that produce or are capable of producing 
antibodies that will specifically bind to the immunizing agent. Alternatively, the lymphocytes may be 
inunimized in vitro. 

The immunizing agent will typically include the PRO polypeptide or a fusion protein thereof. 
Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human origin are desired, or 
spleen cells or lymph node cells are used if non-human mammalian sources are desired. The lymphoc5^es 
are then fused with an immortalized cell line using a suitable fusiug agent, such as polyethylene glycol, to 
form a hybridoma cell [Coding, Monoclonal Antibodies: Principles and Practice. Academic Press, (1986) 
pp. 59-103], Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells 
of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are employed. The 
hybridoma cells may be cultured in a suitable culture medium that preferably contains one or more 
substances that inhibit the growth or survival of the unfused, inunortalized cells. For example, if the 
parental cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and thynaidine 
("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a medium such as 
HAT medium. More preferred inunortalized cell lines are murine myeloma lines, which can be obtained. 
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for instance, jBrom the Salk Institute Cell Distribution Center, San Diego, California and the American 
Type Culture Collection, Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell 
lines also have been described for the production of human monoclonal antibodies [Kozbor, J. Immunol.. 
133 :3001 (1984); Brodeur et al,. Monoclonal Antibody Production Techniques and Applications. Marcel 
5 Dekker, Inc., New York, (1987) pp. 51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for the 
presence of monoclonal antibodies directed against PRO. Preferably, the binding specificity of 
monoclonal antibodies produced by the hybridoma cells is determined by immimoprecipitation or by an in 
vitro binding assay, such as radioinoonaunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). 
10 Such techniques and assays are known in the art. The binding affinity of the monoclonal antibody can, 
for example, be determined by the Scatchard analysis of Munson and Pollard, Anal. Biochem., 107: 220 
(1980). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 
procedures and grown by standard methods [Coding, supral . Suitable culture media for this purpose 

15 include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. Alternatively, the 
hybridoma cells may be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from die culture 
medium or ascites fluid by conventional immimoglobulin purification procedures such as, for example, 
protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity 

20 chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those 
described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can 
be readily isolated and sequenced using conventional procedures (e.g., by using oligonucleotide probes 
that are capable of binding specifically to genes encoding the heavy and light chains of murine antibodies). 

25 The hybridoma cells of the invention serve as a preferred source of such DNA. Once isolated, the DNA 
may be placed into expression vectors, which are then transfected into host cells such as simian COS cells, 
Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin 
protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also 
may be modified, for example, by substituting the coding sequence for human heavy and light chain 

30 constant domains in place of the homologous murine sequences [U.S. Patent No. 4,816,567; Morrison et 
al., supral or by covalently joining to the inmaunoglobulin coding sequence all or part of the coding 
sequence for a non-inamunoglobulin polypeptide. Such a non-inununoglobulin pol3/peptide can be 
substituted for the constant domains of an antibody of the invention, or can be substituted for the variable 
domains of one antigen-combining site of an antibody of the invention to create a chimeric bivalent 

35 antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies are 
well known in the art. For example, one method involves recombinant expression of immunoglobulin 
light chain and modified heavy chain. The heavy chain is truncated generally at any point in the Fc region 
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so as to prevent heavy chain crosslinkmg. Alternatively, the relevant cysteine residues are substituted with 
another amino acid residue or are deleted so as to prevent crosslinking. 

In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to 
produce fragments thereof, particularly. Fab fragments, can be accomplished using routine techniques 
5 known in tlie art. 

3. Human and Humanized Antibodies 

The anti-PRO antibodies of the invention may further comprise humanized antibodies or hirnian 
antibodies. Humanized forms of non-hxmian (e.g., murine) antibodies are chimeric inmiunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 or other antigen-binding 
10 subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin, 
Himianized antibodies include human inamunoglobulins (recipient antibody) in which residues from a 
complementary deternaiiiing region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of the human incununoglobulin are replaced by 
15 corresponding non-human residues. Humanized antibodies may also comprise residues which are foimd 
neither in the recipient antibody nor in the imported CDR or framework sequences. In general, the 
humanized antibody will comprise substantially all of at least one, and typically two, variable domains, in 
which all or substantially all of the CDR regions correspond to those of a non-human inomunoglobulin and 
all or substantially all of the FR regions are those of a human immunoglobulin consensus sequence. The 
20 humanized antibody optimally also will comprise at least a portion of an immunoglobulin constant region 
(Fc), typically that of a hinnan immunoglobulin [Jones et al.. Nature, 321:522-525 (1986); Riechmann et 
al.. Nature, 332:323-329 (1988); and Presta, Curr. Op. Struct. Biol.. 2:593-596 (1992)]. 

Methods for humanizing non-himiaa antibodies are well known in the art. Generally, a 
hmnanized antibody has one or more amino acid residues introduced into it from a source which is non- 
25 hmnan. These non-human amino acid residues are often referred to as "import" residues, which are 

typically taken from an "import" variable domain. Humanization can be essentially performed following 
the method of Winter and co-workers [Jones et al.. Nature, 321:522-525 (1986); Riechmann et al.. 
Nature, 332:323-327 (1988); Verhoeyen et al.. Science, 239:1534-1536 (1988)], by substituting rodent 
CDRs or CDR sequences for the corresponding sequences of a human antibody. Accordingly, such 
30 "humanized" antibodies are chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less 
than an intact human variable domain has been substituted by the corresponding sequence from a non- 
human species. In practice, humanized antibodies are typically hmnan antibodies in which some CDR 
residues and possibly some FR residues are substituted by residues from analogous sites in rodent 
antibodies. 

35 Human antibodies can also be produced using various techniques known in the art, including 

phage display libraries [Hoogenboom and Wmter, J. Mol. Biol.. 227:381 (1991); Marks et al., J. Mol. 
Biol., 222: 581 (1991)]. The techniques of Cole et al. and Boerner et al. are also available for the 
preparation of human monoclonal antibodies (Cole et al.. Monoclonal Antibodies and Cancer Therapv, 
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Alaa R. Liss, p. 77 (1985) and Boeraer et al., J. Immunol.. 14701:86-95 (1991)]. Similarly, human 
antibodies can be made by introducing of human immimoglobulin loci into transgenic animals, e.g., mice 
in which the endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in humans in all 
respects, including gene rearrangement, assembly, and antibody repertoire. This approach is described, 
for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, 
and in the following scientific publications: Marks et al, Bio/Technology 10, 779-783 (1992); Lonberg et 
al.. Nature 368 856-859 (1994); Morrison, Nature 368, 812-13 (1994); Fishwild etal, Nature 
Biotechnology 14, 845-51 (1996); Neuberger, Nature Biotechnology 14. 826 (1996); Lonberg and Huszar, 
Intem. Rev. Immunol. 13 65-93 (1995). 

The antibodies may also be affinity matured using known selection and/or mutagenesis methods 
as described above. Preferred affinity matured antibodies have an affinity which is five times, more 
preferably 10 times, even more preferably 20 or 30 times greater than the starting antibody (generally 
murine, humanized or hxmian) from which the matured antibody is prepared. 

4. Bispecific Antibodies 
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have 
binding specificities for at least two different antigens. In the present case, one of the binding specificities 
is for the PRO, the other one is for any other antigen, and preferably for a cell-surface protein or receptor 
or receptor subimit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant 
production of bispecific antibodies is based on the co-expression of two immunoglobulin heavy- 
chain/light-chain pairs, where the two heavy chains have different specificities [Milstem and Cuello, 
Nature. 305:537-539 (1983)]. Because of the random assortment of immunoglobulin heavy and light 
chains, these hybridomas (quadromas) produce a potential mixture of ten different antibody molecules, of 
which only one has the correct bispecific structure. The purification of the correct molecule is usually 
accomplished by affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, 
published 13 May 1993, and in Traunecker et al., EMBQ J., 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining 
sites) can be fused to immunoglobulin constant domain sequences. The fusion preferably is with an 
immunoglobulin heavy-chain constant domain, comprising at least part of the hinge, CH2, and CH3 
regions. It is preferred to have the first heavy-chain constant region (CHI) containing the site necessary 
for light-chain binding present in at least one of the fusions. DNAs encoding the immunoglobulin heavy- 
chain fusions and, if desired, the immunoglobulin light chain, are inserted into separate expression 
vectors, and are co-transfected into a suitable host organism. For further details of generating bispecific 
antibodies see, for example, Suresh et al.. Methods in Enzymology. 121:210 (1986). 

According to another approach described in WO 96/27011, the interface between a pair of 
antibody molecules can be engineered to maximize the percentage of heterodimers which are recovered 



88 



wo 2004/041170 



PCT/US2003/034312 



from recombinant cell culture. The preferred interface comprises at least a part of the CHS region of an 
antibody constant domain. In this method, one or more small amino acid side chains from the interface of 
the first antibody molecule are replaced with larger side chains (e.g. tyrosiae or tryptophan). 
Compensatory "cavities" of identical or similar size to the large side chaiQ(s) are created on the interface 
5 of the second antibody molecule by replacing large amino acid side chains with smaller ones (e.g. alaniae 
or threonine). This provides a mechanism for increasing the yield of the heterodimer over other imwanted 
end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. F(ab')2 
bispecific antibodies). Techniques for generating bispecific antibodies from antibody fragments have been 

10 described in the literature. For example, bispecific antibodies can be prepared can be prepared using 

chemical linkage. Brennan et al.^ Science 229:81 (1985) describe a procedxure wherein intact antibodies 
are proteolytically cleaved to generate F(ab')2 fragments. These fragments are reduced in the presence of 
the dithiol complexing agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular 
disulfide formation. The Fab' fragments generated are then converted to thionitrobenzoate (TNB) 

15 derivatives. One of the Fab'-TNB derivatives is then reconverted to the Fab'-thiol by reduction with 

mercaptoethylamine and is mixed with an equimolar amount of the other Fab'-TNB derivative to form the 
bispecific antibody. The bispecific antibodies produced can be used as agents for the selective 
immobilization of enzymes. 

Fab' fragments may be directly recovered from E, coli and chemically coupled to form bispecific 

20 antibodies. Shalaby et al , J. Exp. Med. 175:217-225 (1992) describe the production of a fully humanized 
bispecific antibody F(ab')2 molecule. Each Fab' fragment was separately secreted from E, coli and 
subjected to directed chemical coupling in vitro to form the bispecific antibody. The bispecific antibody 
thus formed was able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as 
well as trigger the lytic activity of human C3^otoxic lymphocytes against human breast tumor targets. 

25 Various technique for making and isolating bispecific antibody fragments directly from 

recombinant cell culture have also been described. For example, bispecific antibodies have been produced 
using leucine zippers. Kostelny et al. , J. Immunol. 148(5): 1547-1553 (1992). The leucine zipper 
peptides from the Fos and Jun proteins were linked to the Fab' portions of two different antibodies by 
gene ftision. The antibody homodimers were reduced at the hinge region to form monomers and then 

30 re-oxidized to form the antibody heterodimers. This method can also be utilized for die production of 
antibody homodimers. The "diabody" technology described by HoUinger et al. , Proc. Natl. Acad. Sci. 
USA 90:6444-6448 (1993) has provided an alternative mechanism for making bispecific antibody 
fragments. The fragments comprise a heavy-chain variable domain (Vh) connected to a light-chain 
variable domain (VJ by a linker which is too short to allow pairing between the two domains on the same 

35 chain. Accordingly, the Vh and Vl domains of one fragment are forced to pair with the complementary 
Vl and Vh domains of another fragment, thereby forming two antigen-binding sites. Another strategy for 
making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been reported. 
See, Gruber et al, J. Inamunol. 152:5368 (1994). 



89 



wo 2004/041170 



PCT/US2003/034312 



Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be 
prepared. Tutt et al. , J. Immunol. 147:60 (1991). 

Exemplary bispecific antibodies may bind to two different epitopes on a given PRO pol3^eptide 
herein. Alternatively, an anti-PRO polypeptide arm may be combined with an arm which binds to a 
5 triggering molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CDS, CD28, or B7), or 
Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and FcyRIII (CD16) so as to focus 
cellular defense mechanisms to the cell expressing the particular PRO pol)rpeptide. Bispecific antibodies 
may also be used to localize cytotoxic agents to cells which express a particular PRO polypeptide. These 
antibodies possess a PRO-binding arm and an arm which binds a cytotoxic agent or a radionuclide 
10 chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody of interest binds the 
PRO polypeptide and further binds tissue factor (TF). 

5. Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been 

15 proposed to target immune system cells to unwanted cells [U.S. Patent No. 4,676,980], and for treatment 
of HIV infection [WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may 
be prepared in vitro using known methods in synthetic protein chemistry, including those involving 
crosslinking agents. For example, immimotoxins may be constructed using a disulfide exchange reaction 
or by forming a thioether bond. Examples of suitable reagents for this purpose include iminothiolate and 

20 methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980. 

6. Effector Function Engineering 

It may be desirable to modify the antibod}' of the invention with respect to effector function, so as 
to enhance, e.g,, the effectiveness of the antibody in treating cancer. For example, cysteine residue(s) 

25 may be introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. 
The homodimeric antibody thus generated may have improved internalization capability and/or increased 
complement-mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et al. , 
J. Exp Med. . 176: 1191-1195 (1992) and Shopes, J. Immunol. . 148: 2918-2922 (1992). Homodimeric 
antibodies with enhanced anti-tumor activity may also be prepared using heterobifunctional cross-linkers as 

30 described in Wolff et al Cancer Research, 53: 2560-2565 (1993). Alternatively, an antibody can be 
engineered that has dual Fc regions and may thereby have enhanced complement lysis and ADCC 
capabilities. See Stevenson et al., Anti-Cancer Drug Design. 3: 219-230 (1989). 

7. Inomaimoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a 
35 cytotoxic agent such as a chemotherapeutic agent, toxin (e.g, , an enzymatically active toxin of bacterial, 
fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such irmnunoconjugates have been described 
above. Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain. 
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nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudoinonas aeruginosa), ricin 
A chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, 
Phytolaca omericana proteins (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, 
sapaonaria officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the 
5 tricothecenes. A variety of radionuclides are available for the production of radioconjugated antibodies. 
Examples include ^'^Bi, '^'I, '^^In, ^°Y, and '^^Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional protein- 
coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), 
bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as 

10 disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p- 
azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)- 
ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis-active fluorine compoimds 
(such as l,5-difluoro-2,4-dimtrobenzene). For example, a ricin immunotoxin can be prepared as 
described in Vitetta et al,^ Science, 238: 1098 (1987). Carbon- 14-labeled l-isothiocyanatobenzyl-3- 

15 methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for conjugation of 
radionucleotide to the antibody. See WO94/11026. 

In another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for 
utilization in tumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, 
followed by removal of mboimd conjugate from the circulation using a clearing agent and tibien 

20 administration of a "ligand" (e.g., avidin) that is conjugated to a cytotoxic agent (e.g., a radionucleotide). 
8. Immunolipo somes 

The antibodies disclosed herein may also be formulated as inununoliposomes. Liposomes 
containing the antibody are prepared by methods known in the art, such as described in Epstein et aL, 
Proc. NatL Acad. Sci. USA, 82: 3688 (1985); Hwang etal, Proc. Natl Acad. Sci. USA. 77: 4030 

25 (1980); and U.S. Pat. Nos. 4,485,045 and 4,544,545. Liposomes with enhanced circulation time are 
disclosed in U.S. Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation method 
with a lipid composition comprising phosphatidylcholine, cholesterol, and PEG-derivatized 
phosphatidylethanolamine (PEG-PE). Liposomes are extruded through filters of dejSned pore size to jdeld 

30 liposomes with the desired diameter. Fab' fragments of the antibody of the present invention can be 
conjugated to the liposomes as described in Martin et al J. Biol. Chem., 257: 286-288 (1982) via a 
disulfide-interchange reaction, A chemotherapeutic agent (such as Doxorubicin) is optionally contained 
within the liposome. See Gabizon et aL , J. National Cancer Inst., 81(19): 1484 (1989). 



35 M. Pharmaceutical Compositions 

The active PRO molecules of the invention (e.g., PRO polypeptides, anti-PRO antibodies, and/or 
variants of each) as well as other molecules identified by the screening assays disclosed above, can be 
administered for the treatment of immime related diseases, in the form of pharmaceutical compositions. 
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Therapeutic formulations of the active PRO molecule, preferably a polypeptide or antibody of the 
invention, are prepared for storage by mixing the active molecule having the desured degree of purity with 
optional pharmaceutically acceptable carriers, excipients or stabilizers {Remington* s PMrmaceutical 
Sciences 16th edition, Osol, A. Ed. [1980]), in the form of lyophilized formulations or aqueous solutions. 
5 Acceptable carriers, excipients, or stabilizers are nontoxic to recipients at the dosages and concentrations 
employed, and include buffers such as phosphate, citrate, and other organic acids; antioxidants including 
ascorbic acid and methionine; preservatives (such as octadecyldimethylbenzyl ammonium cliloride; 
hexamethonium chloride; benzalkonimn cliloride, benzedionium chloride; phenol, butyl or benzyl alcohol; 
alkyl parabens such as methyl or propyl paraben; catechol; resorcinol; cyclohexanol; S-pentanol; and m- 

10 cresol); low molecular weight (less than about 10 residues) polypeptides; proteins, such as serum albumin, 
gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as 
glycine, glutamine, asparagme, histidine, arginine, or lysine; monosaccharides, disaccharides, and other 
carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugars such as 
sucrose, mannitol, trehalose or sorbitol; salt-forming counter-ions such as sodium; metal complexes {e.g., 

15 Zn-protem complexes); and/or non-ionic surfactants such as TWEEN™, PLURONICS"^^ or polyethylene 
glycol (PEG). 

Compounds identified by the screening assays disclosed herein can be formulated in an analogous 
manner, using standard techniques well known in the art. 

Lipofections or liposomes can also be used to deliver the PRO molecule into ceUs. Where 

20 antibody fragments are used, the smallest inhibitory fragment which specifically binds to the binding 
domam of the target protein is preferred. For example, based upon the variable region sequences of an 
antibody, peptide molecules can be designed which retain the abilit}^ to bind the target protein sequence. 
Such peptides can be synthesized chemically and/or produced by recombinant DNA technology (see, e.g., 
Marasco et aL, Proc. Natl. Acad. Sci. USA 90, 7889-7893 [1993]). 

25 The formulation herein may also contain more than one active compoimd as necessary for the 

particular indication being treated, preferably those with complementary activities that do not adversely 
affect each other. Alternatively, or in addition, the composition may comprise a cytotoxic agent, c3^okine 
or growth inhibitory agent. Such molecules are suitably present in combination in amoimts that are 
effective for the purpose intended. 

30 The active PRO molecules may also be entrapped in microcapsules prepared, for example, by 

coacervation techniques or by mterfacial polymerization, for example, hydroxymethylcellulose or gelatin- 
microcapsules and poly-(methyhnethacylate) microcapsules, respectively, in colloidal drug delivery 
systems (for example, liposomes, albumin microspheres, microemulsions, nano-particles and 
nanocapsules) or in macroemulsions. Such techniques are disclosed in Remington's Pharmaceutical 

35 Sciences 16th edition, Osol, A. Ed. (1980). 

The formulations to be used for in vivo administration must be sterile. This is readily 
accomplished by filtration through sterile filtration membranes. 

92 



wo 2004/041170 



PCT/US2003/034312 



Sustained-release preparations or the PRO molecules may be prepared. Suitable examples of 
sustained-release preparations include semipermeable matrices of solid hydrophobic polymers containing 
the antibody, which matrices are in the form of shaped articles, e.g., films, or microcapsules. Examples 
of sustained-release matrices include polyesters, hydrogels (for example, poly(2-hydroxyethyl- 
5 methacrylate), or poly(vinylalcohol)), polylactides (U.S. Pat. No. 3,773,919), copolymers of L-glutamic 
acid and y-ethyl-L-glutamate, non-degradable ethylene-vinyl acetate, degradable lactic acid-glycolic acid 
copolymers such as the LUPRON DEPOT*^^ (injectable microspheres composed of lactic acid-glycolic 
acid copol3aner and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid. While polymers such as 
ethylene-vinyl acetate and lactic acid-glycolic acid enable release of molecules for over 100 days, certain 

10 hydrogels release proteins for shorter time periods. When encapsulated antibodies remain in the body for 
a long time, they may denature or aggregate as a result of exposure to moisture at 37°C, resulting in a loss 
of biological activity and possible changes in immunogenicity. Rational strategies can be devised for 
stabilization depending on the mechanism involved. For example, if the aggregation mechanism is 
discovered to be intermolecular S-S bond formation through thio-disulfide interchange, stabilization may 

15 be achieved by modifying sulfliydryl residues, lyophilizing from acidic solutions, controlling moisture 
content, using appropriate additives, and developing specific polymer matrix compositions. 
N. Methods of Treatment 

It is contemplated that the polypeptides, antibodies and other active compounds of the present 
invention may be used to treat various immune related diseases and conditions, such as 

20 monoc5^e/macrophage diseases, iucluding those characterized by iofiltration of inflammatory cells into a 
tissue, stimulation of monocyte/macrophages, inlnbition of monocytes/macrophages, increased or 
decreased vascular permeability or the inhibition thereof. 

Exemplary conditions or disorders to be treated with the polypeptides, antibodies and other 
compounds of the invention, include, but are not limited to systemic lupus erythematosis, rheumatoid 

25 arthritis, juvenile chronic arthritis, osteoarthritis, spondyloarthropathies, systemic sclerosis (scleroderma), 
idiopathic inflammatory myopathies (dermatomyositis, polymyositis), Sjogren's syndrome, systemic 
vasculitis, sarcoidosis, autoimmune hemolj^ic anemia (immune pancytopenia, paroxysmal nocturnal 
hemoglobinuria), autoimmime thrombocytopenia (idiopadiic thromboc54openic pxupura, immune-mediated 
thrombocytopenia), thyroiditis (Grave's disease, Hashimoto's thyroiditis, juvenile Ijonphocytic thyroiditis, 

30 atrophic thyroiditis), diabetes mellitus, immune-mediated renal disease (glomerulonephritis, 
tubulointerstitial nephritis), demyelinating diseases of the central and peripheral nervous systems such as 
multiple sclerosis, idiopathic demyelinating polyneuropathy or Guillain-Barre syndrome, and chronic 
inflammatory demyelinating polyneuropathy, hepatobiliary diseases such as infectious hepatitis (hepatitis 
A, B, C, D, E and other non-hepatotropic viruses), autoimmime chronic active hepatitis, primary biliary 

35 cirrhosis, granulomatous hepatitis, and sclerosing cholangitis, inflanmiatory bowel disease (ulcerative 
colitis: Crohn's disease), gluten-sensitive enteropathy, and Whipple's disease, autoimmune or immune- 
mediated skin diseases including bullous skin diseases, erythema multiforme and contact dermatitis, 
psoriasis, allergic diseases such as asthma, allergic rhinitis, atopic dermatitis, food hypersensitivity and 
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urticaria, immunologic diseases of ttie lung such as eosinophilic pneumonias, idiopathic pulmonary fibrosis 
and hypersensitivity pneumonitis, transplantation associated diseases includmg graft rejection and graft - 
versus-host-disease . 

Rheumatoid arthritis (RA) is a chronic systemic autoimmune inflammatory disease that mainly 
5 involves the synovial membrane of multiple joints with resultant injury to the articular cartilage. The 
pathogenesis is T lymphocyte dependent and is associated with the production of rheumatoid factors, auto- 
antibodies directed against self IgG, with the resultant formation of immxme complexes that attain high 
levels in joint fluid and blood. These complexes in the joint may induce the marked infiltrate of 
lymphocytes and monocytes/macrophages into the synovium and subsequent marked synovial changes; the 

10 joint space/fluid if infiltrated by similar cells with the addition of numerous neutrophils. Tissues affected 
are primarily the joints, often in symmetrical pattern. However, extra-articular disease also occurs in two 
major forms. One form is the development of extra-articular lesions with ongoing progressive joint 
disease and typical lesions of pulmonary fibrosis, vasculitis, and cutaneous ulcers. The second form of 
extra-articular disease is the so called Felty*s syndrome which occurs late m the RA disease course, 

15 sometimes after joint disease has become quiescent, and involves the presence of neutropenia, 
thrombocytopenia and splenomegaly. This can be accompanied by vasculitis in multiple organs with 
formations of infarcts, skin ulcers and gangrene. Patients often also develop rheumatoid nodules in the 
subcutis tissue overlying affected joints; the nodules late stage have necrotic centers surrounded by a 
mixed inflammatory cell infiltrate. Other manifestations which can occur in RA include: pericarditis, 

20 pleuritis, coronary arteritis, intestitial pneumonitis with pulmonary fibrosis, keratoconjunctivitis sicca, and 
rheumatoid nodules. The number and activation state of macrophages in the inflamed synovius correlates 
with the significance of RA (Kinne et aL, 2000 Arthritis Res. 2: 189-202). As described above, 
macrophages are not believed to be involved in the early events of RA, but monocytes/macrophages have 
tissue destructive and tissue remodeling properties which may contribute to both acute and chronic RA. 

25 Juvenile chronic arthritis is a chronic idiopathic inflammatory disease which begins often at less 

than 16 years of age. Its phenotype has some similarities to RA; some patients which are rhematoid factor 
positive are classified as juvenile rheumatoid arthritis. The disease is sub-classified into three major 
categories: pauciarticular, polyarticular, and systemic. The arthritis can be severe and is typically 
destructive and leads to joint ankylosis and retarded growth. Other manifestations can include chronic 

30 anterior uveitis and systemic amyloidosis. 

Spondyloarthropathies are a group of disorders with some common clinical features and the 
common association with the expression of HLA-B27 gene product. The disorders include: ankylosing 
sponylitis, Reiter*s syndrome (reactive arthritis), arthritis associated with inflammatory bowel disease, 
spondylitis associated with psoriasis, juvenile onset spondyloarthropathy and undifferentiated 

35 spondyloarthropathy. Distinguishing features include sacroileitis with or without spondylitis; 
inflammatory asymmetric arthritis; association with HLA-B27 (a serologically defined allele of the HLA-B 
locus of class I MHC); ocular inflammation, and absence of autoantibodies associated with other 
rheumatoid disease. It was shown that CD 163+ macrophages were increased in the synovial lining and 
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colonic mucosa in Spondyloarthropathy and correlates with the expression of HLA-DR and the production 
of TNF-alpha (Baeten et al., 2002 J Pathol 196(3):343-350). 

Systemic sclerosis (scleroderma) has an unknown etiology. A hallmark of the disease is 
induration of the skin; likely this is induced by an active inflammatory process. Scleroderma can be 
5 localized or systemic; vascular lesions are common and endothelial cell injury in the microvasculature is 
an early and important event in the development of systemic sclerosis; the vascular injury may be immime 
mediated. An immimologic basis is implied by the presence of mononuclear cell infiltrates in the 
cutaneous lesions and the presence of anti-nuclear antibodies in many patients. ICAM-1 is often 
upregulated on the cell surface of fibroblasts in skin lesions suggesting that T cell interaction with these 

10 cells may have a role in the patliogenesis of tlae disease. As well as T cells, monocj^es/macrophages are 
proposed to play a role in the progression of scleroderma by secreting fibrogenic cytokines (Yamamoto et 
al., 2001 J Dermatol Sci 26(2): 133-139). Other organs involved include: the gastrointestinal tract: 
smooth muscle atrophy and fibrosis resulting in abnormal peristalsis/motility; kidney: concentric 
subendothelial intimal proliferation affecting small arcuate and interlobular arteries with resultant reduced 

15 renal cortical blood flow, results in proteinuria, azotenwa and hypertension; skeletal muscle: atrophy, 
interstitial fibrosis; inflammation; lung: interstitial pneumonitis and interstitial fibrosis; and heart: 
contraction band necrosis, scarring/fibrosis. 

Idiopathic inflanamatory myopathies including dermatomyositis, polymyositis and others are 
disorders of chronic muscle inflammation of unknown etiology resulting in muscle weakness. Muscle 

20 injury/inflammation is often symmetric and progressive. Autoantibodies are associated with most forms. 
These myositis-specific autoantibodies are directed against and inhibit the function of components, proteins 
and RNA's, involved in protein synthesis. 

Sjogren's syndrome is due to immune-mediated inflammation and subsequent functional 
destruction of the tear glands and salivary glands. The disease can be associated with or accompanied by 

25 inflammatory coimective tissue diseases. The disease is associated with autoantibody production against 
Ro and La antigens, both of which are small RNA-protein complexes. Lesions result in 
keratoconjimctivitis sicca, xerostomia, with other manifestations or associations including bilary cirrhosis, 
peripheral or sensory neuropathy, and palpable purpura. 

Systemic vasculitis are diseases in which the primary lesion is inflaimnation and subsequent 

30 damage to blood vessels which results in ischemia/necrosis/degeneration to tissues supplied by the affected 
vessels and eventual end-organ dysfunction in some cases. Vasculitis can also occur as a secondary lesion 
or sequelae to other immune-inflammatory mediated diseases such as rheumatoid ardiritis, systemic 
sclerosis, etc. , particularly in diseases also associated with the formation of immune complexes. Diseases 
in the primary systemic vasculitis group include: systemic necrotizing vasculitis: polyarteritis nodosa, 

35 allergic angiitis and granulomatosis, polyangiitis; Wegener's granulomatosis; lymphomatoid 
granulomatosis; and giant cell arteritis. Miscellaneous vasculitides include: mucocutaneous lymph node 
syndrome (MLNS or Kawasaki's disease), isolated CNS vasculitis, Behet's disease, thromboangiitis 
obliterans (Buerger's disease) and cutaneous necrotizing venulitis. The pathogenic mechanism of most of 
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the types of vasculitis listed is believed to be primarily due to the deposition of immunoglobulin complexes 
in the vessel wall and subsequent induction of an inflammatory response either via ADCC, complement 
activation, or both. 

Sarcoidosis is a condition of unknown etiology which is characterized by the presence of 
epithelioid granulomas in nearly any tissue in the body; involvement of tlie lung is most common. The 
pathogenesis involves the persistence of activated macrophages and lymphoid cells at sites of the disease 
with subsequent clironic sequelae resultant from the release of locally and systemically active products 
released by these cell types. 

Autoimmxme hemolytic anemia including autoimmune hemolytic anemia, immune pancytopenia, 
and paroxysmal noctural hemoglobinuria is a result of production of antibodies that react with antigens 
expressed on the surface of red blood cells (and in some cases other blood cells including platelets as well) 
and is a reflection of the removal of those antibody coated cells via complement mediated lysis and/or 
ADCC/Fc-receptor-mediated mechanisms. 

Thyroiditis including Grave's disease, Hashimoto's thyroiditis, juvenile lymphocytic thyroiditis, 
and atrophic th3n:oiditis, are the result of an autoimmune response against thyroid antigens with production 
of antibodies that react with proteins present in and often specific for the thyroid gland. Experunental 
models exist including spontaneous models: rats (BUF and BB rats) and chickens (obese chicken strain); 
inducible models: immunization of animals with either thyroglobulin, thyroid microsomal antigen (thyroid 
peroxidase). 

Inflammatory and Fibrotic Lung Disease, including Eosinophilic Pneumonias; IdiopatMc 
Pulmonary Fibrosis, and Hypersensitivity Pneumonitis may involve a disregulated inunune-inflammatory 
response. Inhibition of that response would be of tlierapeutic benefit. 

Psoriasis is a T lymphocyte-mediated inflammatory disease. Lesions contain infiltrates of T 
lymphocytes, macrophages and antigen processing cells, and some neutrophils. 

Other diseases in which intervention of the immune and/or inflammatory response have benefit 
are infectious disease including but not limited to viral infection (including but not limited to AIDS, 
hepatitis A, B, C, D, E and herpes) bacterial infection, fungal infections, and protozoal and parasitic 
infections. Molecules (or derivatives/agonists) which stimulate the immune reaction can be utilized 
therapeutically to enhance the immune response to infectious agents), diseases of immunodeficiency 
(molecules/derivatives/agonists) which stimulate the inmiune reaction can be utilized therapeutically to 
enhance the immune response for conditions of inherited, acquired, infectious induced (as in HIV 
infection), or iatrogenic (i.e., as from chemotherapy) immunodeficiency, and neoplasia. 

It has been demonstrated that some human cancer patients develop an antibody and/or 
monocyte/macrophage response to antigens on neoplastic cells. It has also been shown in animal models 
of neoplasia that enhancement of the kmnime response can result in rejection or regression of that 
particular neoplasm. Molecules that enhance the monocyte/macrophage response have utility in vivo in 
enhancing the immune response against neoplasia. Molecules which enhance the monocyte/macrophage 
proliferative response (or small molecule agonists or antibodies that affected tlie same receptor in an 
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agonistic fashion) can be used therapeutically to treat cancer. Molecules that inhibit the 
monoc3^e/niacrophage response also function in vivo during neoplasia to suppress tihie immune response to 
a neoplasm; such molecules can either be expressed by the neoplastic cells themselves or their expression 
can be induced by the neoplasm in other cells. Antagonism of such inhibitory molecules (either with 
5 antibody, small molecule antagonists or other means) enhances immune-mediated tumor rejection. 

Additionally, inhibition of molecules with proinflammatory properties may have therapeutic 
benefit in reperfusion injury; stroke; myocardial infarction; atherosclerosis; acute lung injury; 
hemorrhagic shock; bum; sepsis/septic shock; acute tubular necrosis; endometriosis; degenerative joint 
disease and pancreatis. 

10 The compounds of the present invention, e.g., polypeptides or antibodies, are administered to a 

mammal, preferably a hiraian, in accord with known methods, such as intravenous administration as a 
bolus or by continuous infusion over a period of time, by intramuscular, intraperitoneal, 
intracerobrospinal, subcutaneous, intra-articular, intrasynovial, intrathecal, oral, topical, or inhalation 
(intranasal, intrapulmonary) routes. Intravenous or inhaled administration of polypeptides and antibodies 

15 is preferred. 

In immunoadjuvant Iherapy, other therapeutic regimens, such administration of an anti-cancer 
agent, may be combined with the administration of the proteins, antibodies or compounds of the instant 
invention. For example, the patient to be treated with a the immunoadjuvant of the invention may also 
receive an anti-cancer agent (chemolherapeutic agent) or radiation therapy. Preparation and dosing 

20 schedules for such chemotherapeutic agents may be used according to manufacturers* rastructions or as 
determined empirically by the skilled practitioner. Preparation and dosing schedules for such 
chemotherapy are also described m Chemotherapy Service Ed., M.C, Perry, Williams & Wilkins, 
Baltimore, MD (1992). The chemotherapeutic agent may precede, or follow administration of the 
immunoadjuvant or may be given simultaneously therewith. Additionally, an anti-estrogen compound 

25 such as tamoxifen or an anti-progesterone such as onapristone (see, EP 616812) may be given in dosages 
known for such molecules. 

It may be desirable to also administer antibodies against other immune disease associated or 
tumor associated antigens, such as antibodies which bind to CD20, CDlla, CD18, ErbB2, EGFR, ErbB3, 
ErbB4, or vascular endothelial factor (VEGF), Alternatively, or in addition, two or more antibodies 

30 binding the same or two or more different antigens disclosed herein may be coadministered to the patient. 
Sometimes, it may be beneficial to also administer one or more cytokines to the patient. In one 
embodiment, the PRO polypeptides are coadministered with a growfli inhibitory agent. For example, the 
growfli inhibitory agent may be administered first, followed by a PRO polypeptide. However, 
simultaneous administration or administration first is also contemplated- Suitable dosages for the growth 

35 inhibitory agent are those presently used and may be lowered due to the combined action (synergy) of the 
growth inhibitory agent and the PRO polypeptide. 

For the treatment or reduction in the severity of immune related disease, the appropriate dosage 
of an a compoimd of the invention will depend on the type of disease to be treated, as defined above, the 
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severity and course of the disease, whether tlie agent is administered for preventive or therapeutic 
purposes, previous therapy, the patient's clinical history and response to the compound, and the discretion 
of the attending physician. The compoimd is suitably administered to the patient at one time or over a 
series of treatments. 

5 For example, depending on the type and severity of the disease, about 1 \xg/kg to 15 mg/kg (e.g., 

0.1-20 mg/kg) of polypeptide or antibody is an initial candidate dosage for administration to the patient, 
whether, for example, by one or more separate administrations, or by continuous infusion. A typical daily 
dosage might range from about 1 |Lig/kg to 100 mg/kg or more, depending on the factors mentioned above. 
For repeated administrations over several days or longer, depending on the condition, the treatment is 
10 sustained until a desired suppression of disease symptoms occurs. However, other dosage regimens may 
be useful. The progress of this therapy is easily monitored by conventional techniques and assays. 
O. Articles of Manufacture 

In another embodiment of the invention, an article of manufacture containing materials (e.g., 
comprising a PRO molecule) useful for the diagnosis or treatment of the disorders described above is 

15 provided. The article of manufacture comprises a container and an instruction. Suitable containers 
include, for example, bottles, vials, syringes, and test tubes. The containers may be formed firom a 
variety of materials such as glass or plastic. The container holds a composition which is effective for 
diagnosing or treating the condition and may have a sterile access port (for example the container may be 
an intravenous solution bag or a vial having a stopper pierceable by a hypodermic injection needle). The 

20 active agent in the composition is usually a polypeptide or an antibody of the invention. An instruction or 
label on, or associated with, the container indicates that the composition is used for diagnosing or treating 
the condition of choice. The article of manufacture may further comprise a second container comprising a 
pharmaceutically-acceptable buffer, such as phosphate-buffered saline. Ringer's solution and dextrose 
solution. It may farther include other materials desirable from a commercial and user standpoint, 

25 including other buffers, diluents, filters, needles, syringes, and package inserts with instructions for use. 
P. Diagnosis and Prognosis of Immune Related Disease 

Cell surface proteins, such as proteins which are overexpressed in certain immune related 
diseases, are excellent targets for drug candidates or disease treatment. The same proteins along with 
secreted proteins encoded by the genes amplified in immune related disease states find additional use in the 
30 diagnosis and prognosis of these diseases. For example, antibodies directed against the protein products of 
genes amplified in multiple sclerosis, rheumatoid arthritis, or another immune related disease, can be used 
as diagnostics or prognostics. 

For example, antibodies, including antibody fragments, can be used to qualitatively or 
quantitatively detect the expression of proteins encoded by amplified or overexpressed genes ("marker 
35 gene products"). The antibody preferably is equipped with a detectable, e.g., fluorescent label, and 
binding can be monitored by light microscopy, flow cytometry, fluorimetry, or otlier techniques known in 
the art. These techniques are particularly suitable, if the overejipressed gene encodes a cell surface protein 
Such binding assays are performed essentially as described above. 
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In situ detection of antibody binding to the marker gene products can be performed, for example, 
by inununofluorescence or immunoelectron microscopy. For this purpose, a histological specimen is 
removed from the patient, and a labeled antibody is applied to it, preferably by overlaying the antibody on 
a biological sample. This procedure also allows for determining the distribution of the marker gene 
product in the tissue examined. It will be apparent for those skilled in the art that a wide variety of 
histological methods are readily available for in situ detection. 

The following examples are offered for illustrative purposes only, and are not intended to limit 
the scope of the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by 
reference in their entirety. 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to 
manufacturer's instructions unless otherwise mdicated. The source of those cells identified in the 
following examples, and throughout the specification, by ATCC accession numbers is the American Type 
Culture Collection, Manassas, VA. 

EXAMPLE 1: Microarrav analysis of monocyte/macrophages. 

Nucleic acid microarrays, often containing thousands of gene sequences, are usefiil for identifying 
differentially expressed genes in diseased tissues as compared to their normal counterparts. Using nucleic 
acid microarrays, test and control mRNA samples from test and control tissue samples are reverse 
transcribed and labeled to generate cDNA probes. The cDNA probes are then hybridized to an array of 
nucleic acids immobilized on a solid support. The array is configured such that the sequence and position 
of each member of the array is known. For example, a selection of genes known to be expressed in 
certain disease states may be arrayed on a solid support. Hybridization of a labeled probe with a 
particular array member indicates that the sample firom which the probe was derived expresses that gene. 
If the hybridization signal of a probe from a test (in this instance, differentiated macrophages) sample is 
greater than hybridization signal of a probe fi:om a control (in this instance, non-differentiated monocytes) 
sample, the gene or genes e>qpressed m the test tissue are identified. The implication of this result is that 
an overexpressed protein in a test tissue is useful not only as a diagnostic marker for the presence of the 
disease condition, but also as a therapeutic target for treatment of the disease condition. 

The methodology of hybridization of nucleic acids and microarray technology is well known in 
the art. In one example, the specific preparation of nucleic acids for hybridization and probes, slides, and 
hybridization conditions are all detailed in PCT Patent Application Serial No. PCT/USOl/10482, filed on 
March 30, 2001 and which is herein incorporated by reference. 

In this experiment, CD 14+ monocytes are selected by positive selection according to Miltenyi 
MACS™ protocol. Lymphocytes in 100 ml heparinized blood are separated using Ficoll Paque™. Cells 
are washed twice in PBS/0.5% BSA/2 mM EDTA. In final wash, all gradients are pooled and volume is 
brought to approximately 10 ml. The cells are centrifuged, the supernatant is removed and the cell pellet 
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is resuspended in buffer in a total volume of 10e7 cells per 80 jxl buffer. Add 20 |uil CD14 microbeads per 
lOe? total cells, mix and incubate 15 minutes at 6-12 C. Wash the cells by adding 20x labeling volume of 
buffer, spin pellet and resuspend in 500 ul buffer per lOeS cells. Separate cells with MACS™ depletion 
column type D and check purity of cells by labeling with anti-CD45 and anti-CD14 antibodies (cell purity 
at this point is > 95%). Lyse cells in RNA lysis buffer to obtain a timepoint of Day 0 monocytes, then 
plate remaining cells in 6 well plates in macrophage differentiation medium: DMEM 4,5 ug/ml glucose, 
Pen-Strep, L-glutamine, 20% FBS and 10% Human AB serum (Gemini, Cat # 100-512). Seed cells at 1,5 
X 10e6 per well (6 well Costar cell culture plates) and grow at 37 C, 7% C02. After 24 hours in culture, 
the cells were harvested and lysed in RNA lysis buffer to obtain mRNA for the Day 1 timepoint. The 
remaining cells were kept in culture and until Day 7. After 7 days in culture, the cells were lysed in 
RNA lysis buffer to obtain Day 7 timepoint at which time the cells displayed gross macrophage 
morphology. 

The mRNA was isolated by Qiagen miniprep and analysis run on Affimax™ (Affymetrix Lac. Santa 
Clara, CA) microarray chips and proprietary Genentech microarrays. The cells harvested at Day 0 
timepoint, the Day 1 timepoint, and the Day 7 timepoint were subjected to the same analysis. Genes were 
compared whose expression was upregulated at Day 7 as compared to Day 0 and Day 1 . 

Below are the results of these experiments, demonstrating that various PRO polypeptides of the 
present invention are differentially expressed in differentiated macrophages at Day 7 as compared to non- 
differentiated monocytes at Day 0 and at Day 1. As described above, these data demonstrate that the PRO 
polypeptides of the present invention are useful not only as diagnostic markers for the presence of one or 
more i mm une disorders, but also serve as therapeutic targets for the treatment of those immune disorders. 
Specifically, the cDNAs shown Figures 592, Figure 708, Figure 724, Figure 888, Figure 1095, Figure 
1109, Figure 1456 and Figure 2331 are significantly overexpressed in differentiated macrophages as 
compared to non-differentiated monocytes at Day 0 and Day 1. 

The Figures 1-2517 show the nucleic acids of the invention and their encoded PRO polypeptides 
that are differentially expressed in differentiated macrophages at Day 7 as compared to non-differentiated 
monocytes at Day 0 and at Day 1 . 

EXAMPLE 2: Use of PRO as a hvbridization probe 

The following method describes use of a nucleotide sequence encoding PRO as a hybridization 

probe. 

DNA comprising the coding sequence of full-length or mature PRO as disclosed herein is 
employed as a probe to screen for homologous DNAs (such as those encoding naturally-occurring variants 
of PRO) in hiunan tissue cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed imder the 

following high stringency conditions. Hybridization of radiolabeled PRO-derived probe to the filters is 

performed in a solution of 50% formamide, 5x SSC, 0.1% SDS, 0.1% sodium pyrophosphate, 50 mM 

sodium phosphate, pH 6.8, 2x Denhardfs solution, and 10% dextran sulfate at 42°C for 20 hours. 

Washing of the filters is performed in an aqueous solution of 0. Ix SSC and 0. 1 % SDS at 42°C. 

100 
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DNAs having a desired sequence identity with the DNA encoding full-length native sequence 
PRO can then be identified using standard techniques known in the art. 

EXAMPLE 3: Expression of PRO in E. coli 
5 Tliis example illustrates preparation of an unglycosylated form of PRO by recombinant expression 

mE, coli. 

The DNA sequence encoding PRO is initially amplified using selected PGR primers. The 
primers should contain restriction enzyme sites which correspond to the restriction enzyme sites on the 
selected expression vector. A variety of expression vectors may be employed. An example of a suitable 

10 vector is pBR322 (derived from E. coli; see Bolivar et al.,. Gene, 2:95 (1977)) which contains genes for 
ampicillin and tetracycline resistance. The vector is digested with restriction enzyme and 
dephosphorylated. The PGR amplified sequences are then ligated into the vector. The vector will 
preferably include sequences which encode for an antibiotic resistance gene, a trp promoter, a polyhis 
leader (including the first six STII codons, polyhis sequence, and enterokinase cleavage site), the PRO 

15 coding region, lambda transcriptional termmator, and an argU gene. 

The ligation mixture is then used to transform a selected E, coli strain using the methods 
described in Sambrook et al., supra. Transformants are identified by their abDity to grow on LB plates 
and antibiotic resistant colonies are then selected. Plasmid DNA can be isolated and confirmed by 
restriction analysis and DNA sequencing. 

20 Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented 

with antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The 
cells are then grown to a desired optical density, during which the expression promoter is turned on. 

After culturing the cells for several more hours, the cells can be harvested by centrifugation. The 
cell pellet obtained by the centrifugation can be soiubilized using various agents known in the art, and the 

25 soiubilized PRO protein can then be purified using a metal chelating colmim under conditions that allow 
tight binding of the protein. 

PRO may be expressed in E, coli in a poly-His tagged form, usmg the following procedure. The 
DNA encoding PRO is initially amplified using selected PGR primers. The primers will contain 
restriction enzyme sites which correspond to the restriction enzyme sites on the selected expression vector, 

30 and other useftd sequences providmg for efficient and reliable translation initiation, rapid purification on a 
metal chelation column, and proteolytic removal with enterokinase. The PCR-amplified, poly-His tagged 
sequences are then ligated into an expression vector, which is used to transform an E. coli host based on 
strain 52 (W3110 fuhA(tonA) Ion galE rpoHts(htpRts) clpP(lacIq). Transformants are first grown in LB 
containing 50 mg/ml carbenicillin at 30°G with shaking until an O.D.600 of 3-5 is reached. Gultures are 

35 then diluted 50-100 fold into GRAP media (prepared by mixing 3.57 g (NH4)2S04, 0.71 g sodium 

citrate*2H20, 1.07 g KGl, 5 .36 g Difco yeast extract, 5,36 g Sheffield hycase SF in 500 mL water, as 
well as 110 mM MPOS, pH 7.3, 0.55% (w/v) glucose and 7 mM MgS04) and grown for approximately 
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20-30 hours at 30°C with shaking. Samples are removed to verify expression by SDS-PAGE analysis, and 
the bulk culture is centrifuged to pellet the cells. Cell pellets are frozen until purification and refolding. 

E. coll paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volimies (w/v) in 
7 M guanidme, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make 
final concentrations of O.IM and 0.02 M, respectively, and the solution is stirred overnight at 4°C. This 
step results in a denatured protein with all cysteine residues blocked by sulfitolization. The solution is 
centrifuged at 40,000 rpm in a Beckman Ultracentifage for 30 min. The supernatant is diluted with 3-5 
volumes of metal chelate column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 
micron jSlters to clarify. The clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column 
equilibrated in the metal chelate column buffer. The column is washed with additional buffer containing 
50 mM imidazole (Calbiochem, Utrol grade), pH 7.4. The protein is eluted with buffer containing 250 
mM imidazole. Fractions containing the desired protein are pooled and stored at 4^C. Rotein 
concentration is estimated by its absorbance at 280 nm using the calculated extinction coefficient based on 
its amino acid sequence. 

The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer 
consisting of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM 
EDTA. Refolding volumes are chosen so that the final protein concentration is between 50 to 100 
micrograms/ml. The refolding solution is stirred gently at 4*'C for 12-36 hours. The refolding reaction is 
quenched by the addition of TFA to a final concentration of 0.4% (pH of approximately 3). Before 
further purification of the protein, the solution is filtered through a 0.22 micron filter and acetonitrile is 
added to 2-10% final concentration. The refolded protein is chromatographed on a Poros RI/H reversed 
phase column using a mobile buffer of 0. 1 % TFA with elution with a gradient of acetonitrile from 10 to 
80%. Aliquots of fractions witbi A280 absorbance are analyzed on SDS polyacrylamide gels and fractions 
containing homogeneous refolded protein are pooled. Generally, the properly refolded species of most 
proteins are eluted at the lowest concentrations of acetonitrile since those species are the most compact 
with their hydrophobic interiors shielded from interaction with the reversed phase resin. Aggregated 
species are usually eluted at higher acetonitrile concentrations. In addition to resolving misfolded forms of 
proteins from flie desired form, the reversed phase step also removes endotoxin from the samples. 

Fractions containing Ihe desired folded PRO polypeptide are pooled and the acetonitrile removed 
using a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 
6.8 with 0. 14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine 
(Pharmacia) resins equilibrated in the formulation buffer and sterile filtered. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 4: Expression of PRO in mammalian cells 

This example illustrates preparation of a potentially glycosylated form of PRO by recombinant 
expression in mammalian cells. 
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The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression 
vector. Optionally, the PRO DNA is ligated into pRK5 with selected restriction enzymes to allow 
msertion of the PRO DNA using ligation methods such as described m Sambrook et aL, supra. The 
resulting vector is called pRK5-PRO. 
5 In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 

1573) are grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal 
calf serum and optionally, nutrient components and/or antibiotics. About 10 fxg pRK5-PRO DNA is 
mixed with about 1 fig DNA encoding the VA RNA gene [Thimmappaya et al,, Cell, 31:543 (1982)] and 
dissolved in 500 /xl of 1 mM Tris-HCl, 0.1 mM EDTA, 0.227 M CaClz. To this mixture is added, 

10 dropwise, 500 fil of 50 mM HEPES (pH 7.35), 280 mM NaCl, 1.5 mM NaP04, and a precipitate is 
allowed to form for 10 mmutes at 25°C. The precipitate is suspended and added to the 293 cells and 
allowed to settle for about four hours at 37°C. The culture medium is aspirated off and 2 ml of 20% 
glycerol in PBS is added for 30 seconds. The 293 cells are then washed with serum free mediimi, fresh 
medium is added and the cells are incubated for about 5 days. 

15 Approximately 24 hours after the transfections, the culture medium is removed and replaced with 

culture medium (alone) or culture medium containing 200 pCi/wl ^^S-cysteine and 200 jtiCi/ml ^^S- 
methionine. After a 12 hour incubation, the conditioned medium is collected, concentrated on a spin 
filter, and loaded onto a 15% SDS gel. The processed gel may be dried and exposed to film for a selected 
period of time to reveal the presence of PRO polypeptide. The cultures containing transfected cells may 

20 imdergo fiirtlier incubation (in serum free medium) and the medium is tested in selected bioassays. 

In an alternative technique, PRO may be introduced into 293 cells transiently using the dextran 
sulfate method described by Somparyrac et al., Proc. Natl. Acad. Sci., 12:7575 (1981). 293 cells are 
grown to maxunal density in a spinner flask and 700 fxg pRK5-PRO DNA is added. The cells are first 
concentrated from the spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate 

25 is incubated on the cell pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, 
washed with tissue culture medium, and re-introduced into the spinner flask containing tissue culture 
medium, 5 fig/uol boviae insulin and 0.1 /tg/ml bovine transferrin. After about four days, the conditioned 
media is centrifiiged and filtered to remove cells and debris. The sample contaming expressed PRO can 
then be concentrated and purified by any selected method, such as dialysis and/or column 

30 chromatography. 

In another embodiment, PRO can be expressed in CHO cells. The pRK5-PRO can be transfected 
into CHO cells usmg known reagents such as CaP04 or DEAE-dextran. As described above, the cell 
cultures can be incubated, and the medium replaced with culture medium (alone) or medium containing a 
radiolabel such as ^^S-methionme. After determining the presence of PRO polypeptide, the culture 
35 medium may be replaced with serum free medium. Preferably, the cultures are incubated for about 6 

days, and then the conditioned medium is harvested. The medium containing the expressed PRO can then 
be concentrated and purified by any selected method. 
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Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out 
of the pRK5 vector. The subclone insert can undergo PGR to fuse in frame with a selected epitope tag 
such as a poly-his tag into a Baculovirus expression vector. The poly-his tagged PRO insert can then be 
subcloned into a SV40 promoter/enhancer containing vector containing a selection marker such as DHFR 
5 for selection of stable clones. Finally, the CHO cells can be transfected (as described above) with the 
SV40 promoter/enhancer containing vector. Labeling may be performed, as described above, to verify 
expression. The culture medium containing the expressed poly-His tagged PRO can then be concentrated 
and purified by any selected method, such as by Ni^"^-chelate affmity chromatography. 

PRO may also be expressed in CHO and/or COS cells by a transient expression procedure or in 

10 CHO cells by another stable ejqpression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are 
expressed as an IgG construct (iinmunoadhesia), in which the coding sequences for the soluble forms (e.g. 
extracellular domains) of the respective proteins are fused to an IgGl constant region sequence containing 
the hinge, CH2 and CH2 domains and/or is a poly-His tagged form. 

15 Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector 

using standard techniques as described in Ausubel et al.. Current Protocols of Molecular Biologv. Unit 
3, 16, John Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction 
sites 5' and 3' of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used 
expression in CHO cells is as described in Lucas et al., NucL Acids Res. 24:9 (1774-1779 (1996), and 

20 uses the SV40 early promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate 

reductase (DHFR). DHFR expression permits selection for stable maintenance of the plasmid following 
transfection. 

Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 million 
CHO cells using commercially available transfection reagents Superfect*^ (Quiagen), Dosper® or Fugene 

25 (Boehringer Mannheim). The cells are grown as described in Lucas et al. , suEra. Approximately 3 x 10"^ 
cells are frozen in an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by 
vortexing. The contents are pipetted into a centrifuge tube containing 10 mL of media and centrifuged at 
1000 rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective 

30 media (0.2 |Lmi filtered PS20 with 5% 0.2 iiim diafiltered fetal bovine serum). The cells are then aliquoted 
into a 100 mL spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into 
a 250 mL spinner filled with 150 mL selective growth medium and incubated at 37°C. After another 2-3 
days, 250 mL, 500 mL and 2000 mL spinners are seeded with 3 x 10^ cells/mL. The cell media is 
exchanged with fresh media by centrifugation and resuspension in production medium. Although any 

35 suitable CHO media may be employed, a production medium described in U.S. Patent No. 5,122,469, 

issued June 16, 1992 may actually be used. A 3L production spiimer is seeded at 1.2 x 10^ cells/mL. On 
day 0, pH is determined. On day 1, the spinner is sampled and sparging with filtered air is commenced. 
On day 2, the spumer is sampled, the temperature shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 
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mL of 10% antifoam (e.g., 35% polydimethylsiloxane emulsion, Dow Coming 365 Medical Grade 
Emulsion) taken. Throughout the production, the pH is adjusted as necessary to keep it at around 7.2. 
After 10 days, or until the viability dropped below 70%, the cell culture is harvested by centrifugation and 
filtering through a 0.22 fxra filter. The filtrate was either stored at 4°C or immediately loaded onto 
5 columns for purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). 
Before purification, imidazole is added to the conditioned media to a concentration of 5 mM. The 
conditioned media is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer 
contaming 0.3 M NaCl and 5 mM imidazole at a flow rate of 4-5 ml/mm. at 4°C. After loading, the 

10 column is washed with additional equilibration buffer and the protein eluted with equilibration buffer 

containing 0.25 M imidazole. The highly purified protein is subsequently desalted into a storage buffer 
containing 10 mM Hepes, 0.14 M NaCl and 4% mannitol, pH 6.8, with a 25 ml G25 Superfine 
(Pharmacia) column and stored at -80°C. 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. 

15 The conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been 

equilibrated in 20 mM Na phosphate buffer, pH 6.8. After loading, the column is washed extensively 
witii equilibration buffer before elution with 100 mM citric acid, pH 3,5. The eluted protem is 
immediately neutralized by collecting 1 ml fractions into tubes containing 275 pi of 1 M Tris buffer, pH 9. 
The highly purified protein is subsequently desalted into storage buffer as described above for the poly- 

20 His tagged proteins. The homogeneity is assessed by SDS polyacrylamide gels and by N-termiaal amino 
acid sequencing by Edman degradation. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 5: Expression of PRO in Yeast 
25 The following method describes recombinant expression of PRO in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of PRO 

from the ADH2/GAPDH promoter. DNA encoding PRO and the promoter is inserted into suitable 

restriction enzyme sites in the selected plasmid to direct intracellular e>q)ression of PRO. For secretion, 

DNA encoding PRO can be cloned into the selected plasmid, together with DNA encoding the 
30 ADH2/GAPDH promoter, a native PRO signal peptide or other mammalian signal peptide, or, for 

example, a yeast alpha-factor or invertase secretory signal/leader sequence, and linker sequences (if 

needed) for ejqpression of PRO. 

Yeast cells, such as yeast strain ABllO, can then be transformed with the expression plasmids 

described above and cultured in selected fermentation media. The transformed yeast supematants can be 
35 analyzed by precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by 

staining of the gels with Coomassie Blue stain. 
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Recombinant PRO can subsequently be isolated and purified by removing the yeast cells from the 
fermentation medium by centrifagation and then concentrating the medium using selected cartridge filters. 
The concentrate containing PRO may further be purified using selected column chromatography resins. 
Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

5 

EXAMPLE 6: Expression of PRO in Baculovirus-Infected Insect Cells 

The following method describes recombinant expression of PRO in Baculovirus-infected insect 

cells. 

The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovirus 

10 ejqpression vector. Such epitope tags include poly-his tags and inamunoglobulin tags (like Fc regions of 
IgG). A variety of plasmids may be employed, including plasmids derived from conamercially available 
plasmids such as pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired portion of the 
coding sequence of PRO such as the sequence encoding the extracellular domain of a transmembrane 
protein or the sequence encoding the mature protein if the protein is extracellular is amplified by PGR with 

15 primers complementary to the 5* and 3' regions. The 5* primer may incorporate flanking (selected) 
restriction enzyme sites. The product is then digested with those selected restriction enzymes and 
subcloned into the expression vector. 

Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold^'^ 
virus DNA (Pharmingen) into Spodoptera frugiperda ("Sf9") cells (ATCC CRL 1711) using lipofectin 

20 (commercially available from GIBCO-BRL). After 4-5 days of incubation at 28°C, the released viruses 
are harvested and used for further amplifications. Viral infection and protein expression are performed as 
described by O'Reilley et al., Baculovirus expression vectors: A Laboratory Manual, Oxford: Oxford 
University Press (1994). 

Expressed poly-his tagged PRO can then be purified, for example, by Ni^^-chelate affinity 

25 chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described 
by Rupert et al.'. Nature, 362: 175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication 
buffer (25 mL Hepes, pH 7.9; 12.5 mM MgClj; 0.1 mM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M 
KCl), and sonicated twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the 
supernatant is diluted 50-fold in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) 

30 and filtered through a 0.45 |im filter. A Ni^*^-NTA agarose colimm (commercially available from Qiagen) 
is prepared with a bed volume of 5 mL, washed with 25 mL of water and equilibrated with 25 mL of 
loading buffer. The filtered cell extract is loaded onto the column at 0,5 mL per minute. The column is 
washed to baseline A280 with loadmg buffer, at which point fraction collection is started. Next, the column 
is washed with a secondary wash buffer (50 mM phosphate; 300 vcM NaCl, 10% glycerol, pH 6.0), which 

35 elutes nonspecifically bound protein. After reaching A280 baseline again, tlie column is developed witli a 0 
to 500 mM Imidazole gradient in the secondar}^ wash buffer. One mL fractions are collected and analyzed 
by SDS-PAGE and silver staining or Western blot with Ni^"**-NTA-conjugated to alkaline phosphatase 
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(Qiagen). Fractions containing the eluted His,o-tagged PRO are pooled and dialyzed against loading 
buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 
chromatography techniques, including for instance, Protein A or protein G column chromatography. 
5 Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 7: Preparation of Antibodies that Bind PRO 

This example illustrates preparation of monoclonal antibodies which can specifically bind PRO. 
Techniques for producing the monoclonal antibodies are known in the art and are described, for 
10 instance, in Coding, supra. Lnmunogens that may be employed include purified PRO, fusion proteins 
containing PRO, and cells expressing recombinant PRO on the cell surface. Selection of the immunogen 
can be made by the skilled artisan without imdue experimentation. 

Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund's 
adjuvant and injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. 
15 Alternatively, the immxmogen is emulsified in MPL-TDM adjuvant (Ribi Inamunochemical Research, 

Hamilton, MT) and injected into the animal's hind foot pads. The inomunized mice are then boosted 10 to 
12 days later with additional immunogen emulsified in the selected adjuvant. Thereafter, for several 
weeks, the mice may also be boosted with additional immunization injections. Serum samples may be 
periodically obtained from the mice by retro-orbital bleeding for testing in ELISA assays to detect anti- 
20 PRO antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be 
injected with a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and 
the spleen cells are harvested. The spleen cells are then fused (using 35 % polyethylene glycol) to a 
selected murine myeloma cell line such as P3X63AgU.l, available from ATCC, No. CRL 1597. The 

25 fusions generate hybridoma cells which can then be plated in 96 well tissue culture plates containing HAT 
(hypoxanfhine, aminopterin, and thymidine) medium to inhibit proliferation of non-fiised cells, myeloma 
hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against PRO. Determination of 
"positive" hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in 

30 the art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to 
produce ascites containing the anti-PRO monoclonal antibodies. Alternatively, the hybridoma cells can be 
grown in tissue culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the 
ascites can be accomplished using ammonium sulfate precipitation, followed by gel exclusion 
35 chromatography. Alternatively, affinity chromatography based upon binding of antibody to protein A or 
protein G can be employed. 
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EXAMPLE 8: Purification of PRO Polypeptides Using Specific Antibodies 

Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in 
the art of protein purification. For example, pro-PRO polypeptide, mature PRO pol3^eptide, or pre-PRO 
polypeptide is purified by immunoaffinity chromatography using antibodies specific for the PRO 
5 polypeptide of interest. In general, an immimoaffinity colunm is constructed by covalently coupling the 
anti-PRO polypeptide antibody to an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with 
ammonium sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, 
Piscataway, N J.). Likewise, monoclonal antibodies are prepared from mouse ascites fluid by ammonium 
10 sulfate precipitation or chromatography on immobilized Protein A. Partially purified immunoglobulin is 
covalentiy attached to a chromatographic resin such as CnBr-activated SEPHAROSE'^ (Pharmacia LKB 
Biotechnology). The antibody is coupled to the resin, the resin is blocked, and the derivative resin is 
washed according to the manufacturer's instructions. 

Such an immunoaffinity colunm is utilized in the purification of PRO polj^peptide by preparing a 
15 fraction from cells containing PRO polypeptide in a soluble form. This preparation is derived by 

solubilization of tibie whole cell or of a subcellular firaction obtauied via differential centrifugation by the 
addition of detergent or by other methods well known in the art. Alternatively, soluble PRO polypeptide 
containing a signal sequence may be secreted in useful quantity into the medium in which the cells are 
grown. 

20 A soluble PRO polypeptide-containing preparation is passed over the immunoaffinity column, and 

the colmnn is washed under conditions that allow the preferential absorbance of PRO polj^eptide (e.g. , 
high ionic strength buffers m the presence of detergent). Then, the colimin is eluted under conditions that 
disrupt antibody/PRO polypeptide binding (e,g,, a low pH buffer such as approximately pH 2-3, or a high 
concentration of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected. 

25 

EXAMPLE 9: Drug Screening 

This invention is particularly useful for screening compoxmds by using PRO polypeptides or 
binding fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or 
fragment employed in such a test may either be free in solution, affixed to a solid support, borne on a cell 

30 surface, or located intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host 
cells which are stably transformed with recombinant nucleic acids expressing the PRO polypeptide or 
fragment. Drugs are screened against such transformed cells in competitive binding assays. Such cells, 
either in viable or fixed form, can be used for standard binding assays. One may measure, for example, 
the formation of complexes between PRO polypeptide or a fragment and the agent being tested. 

35 Alternatively, one can examine the diminution in complex formation between the PRO polypeptide and its 
target cell or target receptors caused by the agent being tested. 

Thus, the present invention provides methods of screening for drugs or any other agents which 
can affect a PRO polypeptide-associated disease or disorder. These methods comprise contacting such an 
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agent with an PRO polypeptide or fragment thereof and assaying (I) for the presence of a complex 
between the agent and the PRO polypeptide or fragment, or (ii) for the presence of a complex between the 
PRO polypeptide or fragment and the cell, by methods well known in the art. In such competitive binding 
assays, the PRO polypeptide or fragment is typically labeled. After suitable incubation, free PRO 
5 polypeptide or fragment is separated from that present m bound form, and the amount of free or 

uncomplexed label is a measure of the ability of the particular agent to bind to PRO polypeptide or to 
interfere wifb the PRO polypeptide/cell complex. 

Another technique for drug screening provides high tliroughput screening for compounds having 
suitable binding affinity to a polypeptide and is described in detail in WO 84/03564, published on 
10 September 13, 1984. Briefly stated, large numbers of different small peptide test compounds are 
synthesized on a solid substrate, such as plastic pins or some other surface. As applied to a PRO 
polypeptide, the peptide test compounds are reacted with PRO polypeptide and washed. Bound PRO 
polypeptide is detected by methods well known in the art. Purified PRO polypeptide can also be coated 
directly onto plates for use in the aforementioned drug screening techniques. In addition, non-neutralizing 
15 antibodies can be used to capture the peptide and immobilize it on the solid support. 

This invention also contemplates the use of competitive drug screening assays in which 
neutralizing antibodies capable of binding PRO polypeptide specifically compete with a test compound for 
binding to PRO polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the 
presence of any peptide which shares one or more antigenic determinants with PRO polypeptide. 

20 

EXAMPLE 10: Rational Drug Design 

The goal of rational drug design is to produce structural analogs of biologically active polypeptide 
of interest {i,e, , a PRO polypeptide) or of small molecules with which they interact, e,g, , agonists, 
antagonists, or inhibitors. Any of these examples can be used to fashion drugs which are more active or 

25 stable forms of the PRO polypeptide or which enliance or interfere with the function of the PRO 
polypeptide in vivo (c.f,, Hodgson, Bio/Technology, 9: 19-21 (1991)). 

In one approach, the three-dimensional structure of the PRO polypeptide, or of a PRO 
polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most 
typically, by a combination of the two approaches. Both the shape and charges of the PRO polypeptide 

30 must be ascertained to elucidate the structmre and to determine active site(s) of the molecule. Less often, 
useful information regarding the structure of the PRO polypeptide may be gained by modeling based on 
the structure of homologous proteins. In both cases, relevant structural information is used to design 
analogous PRO polypeptide-like molecules or to identify efficient inhibitors. Useful examples of rational 
drug design may include molecules which have improved activity or stability as shown by Braxton and 

35 Wells, Biochemistry, 31:7796-7801 (1992) or which act as inhibitors, agonists, or antagonists of native 
peptides as shown by Athauda et al. , J. Biochem.. 113: 742-746 (1993). 

It is also possible to isolate a target-specific antibody, selected by functional assay, as described 
above, and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon 
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which subsequent drug design can be based. It is possible to bypass protein crystallography altogether by 
generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically active antibody. As a 
mirror image of a mirror image, the binding site of the anti-ids would be expected to be an analog of the 
original receptor. The anti-id could then be used to identify and isolate peptides from banks of chemically 
5 or biologically produced peptides. The isolated peptides would then act as the pharmacore. 

By virtue of the present invention, sufficient amounts of the PRO polypeptide may be made 
available to perform such anal3^ical studies as X-ray crystallography. In addition, knowledge of the PRO 
polypeptide amino acid sequence provided herein will provide guidance to those employing computer 
modeling techniques in place of or in addition to x-ray crystallography. 

10 The foregoing written specification is considered to be sufficient to enable one skilled in the art to 

practice the invention. The present invention is not to be limited in scope by the construct deposited, since 
the deposited embodiment is intended as a single illustration of certain aspects of the invention and any 
constructs that are functionally equivalent are within the scope of this invention. The deposit of material 
herein does not constitute an admission that the written description herein contained is inadequate to enable 

15 the practice of any aspect of the invention, including the best mode thereof, nor is it to be construed as 
limiting the scope of the claims to the specific illustrations that it represents. Indeed, various 
modifications of the invention in addition to those shown and described herein will become apparent to 
those skilled in the art firom the foregoiug description and fall within the scope of the appended claims. 
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What is claimed: 

1. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a nucleotide 
sequence identity to: 

(a) tlie nucleotide sequence shown in any one of the Figures 1-2517 (SEQ ID NOS: 1- 

2517); or 

(b) the nucleotide sequence encoding the polypeptide shown in any one of the Figures 
1-2517 (SEQ ID NOS: 1-2517). 

2. A vector comprising the nucleic acid of Claim 1. 

3. The vector of Claim 2 operably linked to control sequences recognized by a host cell 
transformed with die vector. 

4. A host cell comprising the vector of Claim 2. 

5. The host cell of Claim 4, wherein said cell is a CHO cell, an E.coli cell or a yeast cell. 

6. A process for producing a PRO polypeptide comprising culturing the host cell of Claim 5 
under conditions suitable for expression of said PRO polypeptide and recovering said PRO polypeptide 
from the cell culture. 

7. An isolated polypeptide having at least 80% amino acid sequence identity to: 

(a) a polypeptide shown in any one of Figures 1-2517 (SEQ ID NOS: 1-2517); or 

(b) a polypeptide encoded by the full length coding region of the nucleotide sequence 
shown in any one of Figures 1-2517 (SEQ ID NOS: 1-2517). 

8. A chimeric molecule comprisiug a polypeptide according to Claim 7 fused to a 
heterologous amino acid sequence. 

9. The chimeric molecule of Claim 8, wherein said heterologous amino acid sequence is an 
epitope tag sequence or an Fc region of an immunoglobulin. 

10. An antibody which specifically binds to a polypeptide according to Claim 7. 

11. The antibody of Claim 10, wherein said antibody is a monoclonal antibody, a humanized 
antibody or a single-chain antibody. 
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12. A composition of matter comprising (a) a polypeptide of Claim 7, (b) an agonist of said 
polypeptide, (c) an antagonist of said polypeptide, or (d) an antibody that binds to said polypeptide, in 
combination with a carrier. 

5 

13. The composition of matter of Claim 12, wherein said carrier is a pharmaceutically 
acceptable carrier. 

14. The composition of matter of Claim 13 comprising a therapeutically effective amoimt of 
10 (a), (b), (c) or (d). 

15. An article of manufacture, comprising: 
a container; 

a label on said container; and 
15 a composition of matter comprising (a) a polypeptide of Claim 7, (b) an agonist of said 

polypeptide, (c) an antagonist of said polypeptide, or (d) an antibody that binds to said polypeptide, 
contained within said container, wherein label on said container indicates that said composition of matter 
can be used for treating an inamune related disease. 

20 16. A method of treating an immune related disorder in a mammal in need thereof 

comprisiQg administering to said mammal a therapeutically effective amount of (a) a polypeptide of Claim 
7, (b) an agonist of said polypeptide, (c) an antagonist of said polypeptide, or (d) an antibody that binds to 
said polypeptide. 

25 17. The method of Claim 16, wherein the immune related disorder is systemic lupus 

erythematosis, rheumatoid arthritis, osteoarthritis, juvenile chronic arthritis, a spondyloarthropathy, 
systemic sclerosis, an idiopathic inflammatory myopathy, Sjogren's syndrome, systemic vasculitis, 
sarcoidosis, autoimmune hemol3rtic anemia, autoimmune thrombocytopenia, thyroiditis, diabetes mellitus, 
immune-mediated renal disease, a demyelinating disease of the central or peripheral nervous system, 

30 idiopathic demyelinating polyneuropathy, Guillain-Barre syndrome, a chronic inflammatDry demyelinating 
polyneuropathy, a hepatobiliary disease, infectious or autounmune chronic active hepatitis, primary biliary 
cirrhosis, granulomatous hepatitis, sclerosing cholangitis, inflammatory bowel disease, gluten-sensitive 
enteropathy, Whipple's disease, an autoimmune or immune-mediated skin disease, a bullous skin disease, 
erythema multiforme, contact dermatitis, psoriasis, an allergic disease, asthma, allergic rhinitis, atopic 

35 dermatitis, food hypersensitivity, urticaria, an immunologic disease of the lung, eosinophilic pneumonias, 
idiopathic pulmonary fibrosis, hypersensitivity pneumonitis, a transplantation associated disease, graft 
rejection or graft-versus-host-disease. 
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18. A metJiod for detennining the presence of a PRO polypeptide of the invention as 
described in Figures 1-2517 (SEQ ID NOS: 1-2517), in a sample suspected of containing said polypeptide, 
said method comprising exposing said sample to an anti-PRO antibody, where the and determming binding 
of said antibody to a component of said sample. 

5 

19. A method of diagnosing an immune related disease in a mammal, said method 
comprising detecting the level of expression of a gene encoding a PRO polypeptide of the invention as 
described in Figures 1-2517 (SEQ ID NOS: 1-2517), (a) in a test sample of tissue cells obtaiaed from the 
mammal, and (b) in a control sample of known normal tissue cells of the same cell type, wherein a higher 

10 or lower level of expression of said gene in the test sample as compared to the control sample is indicative 
of the presence of an immime related disease in the mammal from which the test tissue cells were 
obtained. 

20. A method of diagnosing an immime related disease in a mammal, said method 
15 comprising (a) contacting a PRO polypeptide of the invention as described in Figures 1-2517 (SEQ ID 

NOS: 1-2517), anti-PRO antibody with a test sample of tissue cells obtained from said mammal and (b) 
detecting the formation of a complex between the antibody and the polypeptide in the test sample, wherein 
formation of said complex is indicative of the presence of an immune related disease in the mammal from 
which the test tissue cells were obtained. 

20 

21. A method of identifying a compound that inhibits the activity of a PRO polypeptide of 
the invention as described in Figures 1-2517 (SEQ ID NOS: 1-2517), said method comprising contacting 
cells which normally respond to said polypeptide with (a) said polypeptide and (b) a candidate compound, 

25 and determining the lack responsiveness by said cell to (a). 

22. A method of identifying a compoimd that inhibits the ejq)ression of a gene encoding a 
PRO polypeptide of the invention as described in Figures 1-2517 (SEQ ID NOS: 1-2517), said method 
comprising contacting cells which normally e^qpress said polj^eptide with a candidate compound, and 

30 determining the lack of expression said gene. 

23. The method of Claim 22, wherein said candidate compound is an antisense nucleic acid. 

24. A method of identifying a compound that mimics the activity of a PRO polypeptide of 
35 the invention as described in any one of Figures 1-2517 (SEQ ID NOS: 1-2517), said method comprising 

contacting cells which normally respond to said polypeptide with a candidate compound, and determining 
the responsiveness by said cell to said candidate compound. 
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25. A method of stiinulating the immune response in a mammal, said method comprising 
administering to said mammal an effective amount of a PRO polypeptide of the invention as described in 
any one of Figures 1-2517 (SEQ ID NOS: 1-2517), antagonist, wherein said immune response is 
stimulated. 

26. A method of diagnosing an inflanamatory immune response in a mammal, said method 
comprising detecting the level of expression of a gene encoding a PRO polypeptide of the invention as 
described in any one of Figures 1-2517 (SEQ ID NOS: 1-2517), (a) in a test sample of tissue cells 
obtained from the mammal, and (b) in a control sample of known normal tissue cells of the same cell type, 
wherein a higher or lower level of expression of said gene in the test sample as compared to the control 
sample is indicative of the presence of an inflammatory immune response in the mammal from which the 
test tissue cells were obtained. 

27. A method of differentiating monocytes comprising; 

(a) isolating a population of monocj^es; 

(b) contacting the monocj^es with an effective amoimt of a PRO polypeptide of the 
invention as described in any of of Figures 1-2517 (SEQ ID NOS: 1-2517); and 

(c) determining the differentiation of said monoc3^es to said PRO polypeptide. 
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FIGURE 1 

CATCCGGTGTGGTCGACGGGTCCTCCAAGAGTTTGGGGCGCGGACCGGAGTACCTTGCGTGCAGTTATGTCGGCG 
TCGGTAGTGTCTGTCATTTCGCGGTTCTTAGAAGAGTACTTGAGCTCCACTCCGCAGCGTCTGAAGTTGCTGGAC 
GCGTACCTGCTGTATATACTGCTGACCGGGGCGCTGCAGTTCGGTTACTGTCTCCTCGTGGGGACCTTCCCCTTC 
AACTCTTXTCTCTCGGGCTTCATCTCTXGTGTGGGGAGTTTCATCCTAGCGGXTTGCCTGAGAATACAGATCAAC 
CCACAGAACAAAGCGGATTTCCAAGGCATCTCCCCAGAGCGAGCCTTTGCTGATTTTCTCTTTGCCAGCACCATC 
CTGCACCTTGTTGTCATGAACTTT6TTGGC TGAA TCATTCTCATTTACTTAATTGAGGAGTAGGAGACTAAAAGA 
ATGTTCACTCTTTGAATTTCCTGGATAAGAGTTCTGGAGATGGCAGCTTATTGGACACATGGATTTTCTTCAGAT 
TTGACACTTACTGCTAGCTCTGCTTTTTATGACAGGAGAAAAGCCCAGAGTTCACTGTGTGTCAGAACAACTTTC 
TAACAAACATTTATTAATCCAGCCTCTGCCTTTCATTAAATGTAACCXTTTGCTTTCCAAATTAAAGAACTCCAT 
G C C A C X C C X C AAAAAAAAAAAAAA 
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FIGURE 2 

MSASWSVISRFLEEYLSSTPQRLKLLDAYLLYILLTGALQFGYCLLVGTFPFNSFLSGFISCVGSFILAVCLRI 
QINPQNKADFQGISPERAFADFLFASTILHLWMNFVG 
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FIGURE 3 

CTCCGGCGCAGTGTTGGGACTGTCTGGGTATCGGAAAGCAAGCCTACGTTGCTCACTATTACGTATAATCCTTTT 
CTTTTCAAGATGCCTGAGGAAGTGCACCATGGAGAGGAGGAGGTGGAGACTTTTGCCTTTCAGGCAGAAATTGCC 
CAACTCATGTCCCTCATCATCAATACCTTCTATTCCAACAAGGAGATTTTCCTTCGGGAGTXGATCTCTAATGCT 
TCTGATGCCTTGGACAAGATTCGCTATGAGAGCCTGACAGACCCTTCGAAGTTGGACAGTGGTAAAGAGCTGAAA 
ATTGACATCATCCCCAACCCTCAGGAACGTACCCTGACTTTGGTAGACACAGGCATTGGCATGACCAAAGCTGAT 
CTCATAAATAATTTGGGAACCATTGCCAAGTCTGGTACTAAAGCATTCATGGAGGCTCTTCAGGCTGGTGCAGAC 
ATCTCCATGATTGGGCAGTTTGGTGTTGGCTTTTATTCTGCCTACTTGGTGGCAGAGAAAGTGGTTGTGATCACA 
AAGCACAACGATGATGAACAGTATGCTTGGGAGTCTTCTGCTGGAGGTTCCTTCACTGTGCGTGCTGACCATGGT 
GAGCCCATTGGCAGGGGTACCAAAGTGATCCTCCATCTTAAAGAAGATCAGACAGAGTACCTAGAAGAGAGGCGG 
GTCAAAGAAGTAGTGAAGAAGCATTCTCAGTTCATAGGCTATCCCATCACCCTTTATTTGGAGAAGGAACGAGAG 
AAGGAAATTAGTGATGATGAGGCAGAGGAAGAGAAAGGTGAGAAAGAAGAGGAAGATAAAGATGATGAAGAAAAA 
CCCAAGATCGAAGATGTGGGTTCAGATGAGGAGGATGACAGCGGTAAGGATAAGAAGAAGAAAACTAAGAAGATC 
AAAGAGAAATACAXTGATCAGGAAGAACTAAACAAGACCAAGCCTATTTGGACCAGAAACCCTGATGACATCACC 
CAAGAGGAGTATGGAGAATTCTACAAGAGCCTCACTAATGACTGGGAAGACCACTTGGCAGTCAAGCACTTTTCT 
GTAGAAGGTCAGTTGGAATTCAGGGCATTGCTATTTATTCCTCGTCGGGCTCCCTTTGACCTTTTTGAGAACAAG 
AAGAAAAAGAACAACATCAAACTCTATGTCCGCCGTGTGTTCATCATGGACAGCTGTGATGAGTTGATACCAGAG 
TATCTCAATTTTATCCGTGGTGTGGTTGACTCTGAGGATCTGCCCCTGAACATCTCCCGAGAAATGCTCCAGCAG 
AGCAAAATCTTGAAAGTCATTCGCAAAAACATTGTTAAGAAGTGCCTTGAGCTCTTCTCTGAGCTGGCAGAAGAC 
AAGGAGAATTACAAGAAATTCTATGAGGCATTCTCTAAAAATCTCAAGCTTGGAATCCACGAAGACTCCACTAAC 
CGCCGCCGCCTGTCTGAGCTGCTGCGCTATCATACCTCCCAGTCTGGAGATGAGATGACATCTCTGTCAGAGTAT 
GTTTCTCGCATGAAGGAGACACAGAAGTCCATCTATTACATCACXGGTGAGAGCAAAGAGCAGGTGGCCAACTCA 
GCTTTTGTGGAGCGAGTGCGGAAACGGGGCXXCGAGGTGGTATATATGACCGAGCCCATTGACGAGTACTGXGTG 
CAGCAGCXCAAGGAATTTGATGGGAAGAGCCXGGXCTCAGTTACCAAGGAGGGXCTGGAGCTGCCTGAGGAXGAG 
GAGGAGAAG2\AGAAGAXGGAAGAGAGCAAGGCA2^GXXTGAGAACCXCXGCAAGCXCAXGAAAGAAATCTXAGAX 
AAGAAGGTXGAGAAGGXGACAATCTCCAATAGACTXGXGXCXXCACCXXGCTGCATXGXGACCAGCACCTACGGC 
XGGACAGCCAAXAXGGAGCGGATCAXGAAAGCCCAGGCACTXCGGGACAACXCCACCAXGGGCTAXAXGATGGCC 
AAAAAGCACCTGGAGAXCAACCCXGACCACCCCAXTGTGGAGACGCTGCGGCAGAAGGCXGAGGCCGACAAGAAX 
GAXAAGGCAGTTAAGGACCXGGTGGTGCTGCTGTXXGAAACCGCCCXGCTATCXTCXGGCTTTTCCCTTGAGGAX 
CCCCAGACCCACXCCAACCGCAXCXAXCGCATGAXCAAGCTAGGXCXAGGTATTGATGAAGAXGAAGXGGCAGCA 
GAGGAACCCAATGCTGCAGTXCCTGAXGAGATCCCCCCTCXCGAGGGCGATGAGGAXGCGTCXCGCAXGGAAGAA 
GTCGAT TAG GTXAGGAGXXCATAGTTGGAAAACXXGXGCCCTTGTATAGXGXCCCCAXGGGCTCCCACXGCAGCC 
XCGAGTGCCCCTGXCCCACCXGGCXCCCCCTGCTGGTGXCTAGXGXXXTTTTCCCTCTCCXGXCCXXGXGXXGAA 
GGCAGTAAACTAAGGGTGTCAAGCCCCAXXCCCTCXCXACXCXXGACAGCAGGAXTGGATGTXGXGXAXXGXGGX 
XXATTTTAXXXXCTTCAXXXTGTTCTGAAAXXAAAGXATGCAAAATAAAGAAXAXGCCGTTTTAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAA 
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FIGURE 4 

MPEEVHHGEEEVETFAFQAEIAQLMSLIINTFYSNKEIFLRELISNASDALDKIRYESLTDPSKLDSGKELKIDI 
IPNPQERTLTLVDTGIGMTKADLINNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFYSAYLVAEKWVITKHN 
DDEQYAWESSAGGSFTVRADHGEPIGRGTKVILHLKEDQTEYLEERRVKEWKKHSQFIGYPITLYLEKEREKEI 
SDDEAEEEKGEKEEEDKDDEEKPKIEDVGSDEEDDSGKDKKKKTKKIKEKYIDQEELNKTKPIWTRNPDDITQEE 
YGEFYKSLTNDWEDHLAVKHFSVEGQLEFRALLFIPRRAPFDLFENKKKKNNIKLYVRRVFIMDSCDELIPEYLN 
FIRGWDSEDLPLNISREMLQQSKILKVIRKNIVKKCLELFSELAEDKENYKKFYEAFSKNLKLGIHEDSTNRRR 
LSELLRYHTSQSGDEMTSLSEYVSRMKETQKSIYYITGESKEQVANSAFVERVRKRGFEWYMTEPIDEYCVQQL 
KEFDGKSLVSVTKEGLELPEDEEEKKKMEESKAKFENLCKLMKEILDKKVEKVTISNRLVSSPCCIVTSTYGWTA 
NMERIMKAQALRDNSTMGYMMAKKHLEINPDHPIVETLRQKAEADKNDKAVKDLVVLLFETALLSSGFSLEDPQT 
HSNRIYRMIKLGLGIDEDEVAAEEPNAAVPDEIPPLEGDEDASRMEEVD 
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FIGURE 5 

ATGCAGGCCCCACGGGAGCTCGCGGTGGGCATCGACCTGGGCACCACCTACTCGTGCGTGGGCGTGTTTCAGCAG 
GGCCGCGTGGAGATCCTGGCCAACGACCAGGGCAACCGCACCACGCCCAGCTACGTGGCCTTCACCGACACCGAG 
CGGCTGGTCGGGGACGCGGCCAAGAGCCAGGCGGCCCTGAACCCCCACAACACCGTGTTCGATGCCAAGCGGCTG 
ATCGGGCGCAAGTTCGCGGACACCACGGTGCAGTCGGACATGAAGCACTGGCCCTTCCGGGTGGTGAGCGAGGGC 
GGCAAGCCCAAGGTGCCGGTATCGTACCGCGGGGAGGACAAGACGTTCTACCCCGAGGAGATCTCGTCCATGGTG 
CTGAGCAAGATGAAGGAGACGGCCGAGGCGTACCTGGGCCAGCCCGTGAAGCACGCAGTGATCACCGTGCCCGCC 
TATTXCAATGACTCGCAGCGCCAGGCCACCAAGGACGCGGGGGCCATCGCGGGGCTCAACGTGTTGCGGATCATC 
AATGAGCCCACGGCAGCTGCCATCGCCTATGGGCTGGACCGGCGGGGCGCGGGAGAGCGCAACGTGCTCATTTTT 
GACCTGGGTGGGGGCACCTTCGATGTGTCGGTTCTCTCCATTGACGCTGGTGTCTTTGAGGTGAAAGCCACTGCT 
GGAGATACCCACCTGGGAGGAGAGGACTTCGACAACCGGCTCGTGAACCACTTCATGGAAGAATTCCGGCGGAAG 
CATGGGAAGGACCTGAGCGGGAACAAGCGTGCCCTCGGCAGGCTGCGCACAGCCTGTGAGCGCGCCAAGCGCACC 
CTGTCCTCCAGCACCCAGGCCACCCTGGAGATAGACTCCCTGTTCGAGGGCGTGGACTTCTACACGTCCATCACT 
CGTGCCCGCTTTGAGGAACTGTGCTCAGACCTCTTCCGCAGCACCCTGGAGCCGGTGGAGAAGGCCCTGCGGGAT 
GCCAAGCTGGACAAGGCCCAGATTCATGACGTCGTCCTGGTGGGGGGCTCCACTCGCATCCCCAAGGTGCAGAAG 
TTGCTGCAGGACTTCTTCAACGGCAAGGAGCTGAACAAGAGCATCAACCCTGATGAGGCTGTGGCCTATGGGGCT 
GCTGTGCAGGCGGCCGTGTTGATGGGGGACAAATGTGAGAAAGTGCAGGATCTCCTGCTGCTGGATGTGGCTCCC 
CTGTCTCTGGGGCTGGAGACAGCAGGTGGGGTGATGACCACGCTGATCCAGAGGAACGCCACTATCCCCACCAAG 
CAGACCCAGACTTTCACCACCTACTCGGACAACCAGCCTGGGGTCTTCAXCCAGGTGTATGAGGGTGAGAGGGCC 
ATGACCAAGGACAACAACCTGCTGGGGCGTTTTGAACTCAGTGGCATCCCTCCTGCCCCACGTGGAGTCCCCCAG 
ATAGAGGTGACCTTTGACATTGATGCTAATGGCATCCTGAGCGTGACAGCCACTGACAGGAGCACAGGTAAGGCT 
AACAAGATCACCATCACCAATGACAAGGGCCGGCTGAGCAAGGAGGAGGTGGAGAGGATGGTTCATGAAGCCGAG 
CAGTACAAGGCTGAGGATGAGGCCCAGAGGGACAGAGTGGCTGCCAAAAACTCGCTGGAGGCCCATGTCTTCCAT 
GTGAAAGGTTCTTTGCAAGAGGAAAGCCTTAGGGACAAGATTCCCGAAGAGGACAGGCGCAAAATGCAAGACAAG 
TGTCGGGAAGTCCTTGCCTGGCTGGAGCACAACCAGCTGGCAGAGAAGGAGGAGTATGAGCATCAGAAGAGGGAG 
CTGGAGCAAATCTGTCGCCCCATCTTCTCCAGGCTCTATGGGGGGCCTGGTGTCCCTGGGGGCAGCAGTTGTGGC 
ACTCAAGCCCGCCAGGGGGACCCCAGCACCGGCCCCATCATTGAGGAGGTTGATTGA 
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FIGURE 6 

MQAPRELAVGIDLGTTYSCVGVFQQGRVEILANDQGNRTTPSYVAFTDTERLVGDAAKSQAALNPHNTVFDAKRL 
IGRKFADTTVQSDMKHWPFRWSEGGKPKVPVSYRGEDKXFYPEEISSMVLSKMKETAEAYLGQPVKHAVITVPA 
YFNDSQRQATKDAGAIAGLNVLRIINEPTAAAIAYGLDRRGAGERNVLIFDLGGGTFDVSVLSIDAGVFEVKATA 
GDTHLGGEDFDNRLVNHFMEEFRRKHGKDLSGNKRALGRLRTACERAKRTLSSSTQATLEIDSLFEGVDFYTSIT 
RARFEELCSDLFRSTLEPVEKALRDAKLDKAQIHDWLVGGSTRIPKVQKLLQDFFNGKELNKSINPDEAVAYGA 
AVQAAVLMGDKCEKVQDLLLLDVAPLSLGLETAGGVMTTLIQRNATIPTKQTQTFTTYSDNQPGVFIQVYEGERA 
MXKDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILSVTATDRSTGKANKITITNDKGRLSKEEVERMVHEAE 
QYKAEDEAQRDRVAAKNSLEAHVFHVKGSLQEESLRDKIPEEDRRKMQDKCREVLAWLEHNQLAEKEEYEHQKRE 
LEQICRP IFSRLYGGPGVPGGS SCGTQARQGDP STGP I lEEVD 
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FIGURE 7A 

XGCGACCGCCTCCCTGCGCCCCGCCCCTCCGGCTAGCTCGCTGGCTCCCGGCTCCTCCCGACGTCTCCTACCTCC 

TCACGGCTCTTCCCGGCGCTCTCCTGGCTCCCTTCTGCCCCAGCTCCGTCTCGGCGGCGGCGGGCAGTTGCAGTG 

GTGCAGAATGGCTGACCTCAGTCTTGCAGATGCATTAACAGAACCATCTCCAGACAXXGAGGGAGAGATAAAGCG 

GGACTTCATTGCCACACTAGAGGCAGAGGCCTTTGATGATGTTGTGGGAGAAACTGTTGGAAAAACAGACTATAT 

TCCTCTCCTGGATGTTGATGAGAAAACCGGGAACTCAGAGTCAftAGAAGAAACCGTGCTCAGAAACTAGCCAGAT 

TGAAGATACTCCATCTTCTAAACCAACACTCCTAGCCAATGGTGGTCATGGAGTAGAAGGGAGCGATACTACAGG 

GTCTCCAACTGAATTCCTTGAAGAGAAAATGGCCTACCAGGAATACCCAAATAGCCAGAACTGGCCAGAAGATAC 

CAACTTTTGTTTCCAACCTGAGCAAGTGGTCGATCCTATCCAGACTGATCCCTTTAAGATGTACCATGATGATGA 

CCTGGCAGATTTGGTCTTTCCCTCCAGTGCGACAGCTGATACTTCAATATTTGCAGGACAAAATGATCCCTTGAA 

AGACAGTTACGGTATGTCTCCCTGCAACACAGCTGTXGTACCTCAGGGGTGGTCTGTGGAAGCCTTAAACXCTCC 

ACACXCAGAGXCCXXXGXXXCCCCAGAGGCXGXXGCAGAACCXCCXCAGCCAACGGCAGXXCCCXTAGAGCXAGC 

CAAGGAGAXAGAAAXGGCATCAGAAGAGAGGCCACCAGCACAAGCATTGGAAATAATGATGGGACTGAAGACXAC 

XGACAXGGCACCAXCXAAAGAAACAGAGAXGGCCCXCGCCAAGGACAXGGCACTAGCXACAAAAACCGAGGXGGC 

AXXGGCXAAAGAXAXGGAAXCACCCACCAAAXXAGAXGXGACACXGGCCAAGGACAXGCAGCCAXCCAXGGAAXC 

AGATATGGCCCTAGTCAAGGACAXGGAACXACCCACAGAAAAAGAAGTGGCCCTGGXTAAGGAXGXCAGATGGCC 

CACAGAAACAGAXGXAXCXXCAGCCAAGAAXGTGGXACTGCCCACAGAAACAGAGGXAGCCCCAGCCAAGGAXGX 

GACACXGXXGAAAGAAACAGAGAGGGCAXCXCCXAXAAAAATGGACXXAGCCCCXXCCAAGGACATGGGACCACC 

CAAAGAAAACAAGAAAGAAACAGAGAGGGCATCTCCTATAAAAATGGACTTGGCXCCXXCCAAGGACAXGGGACC 

ACCCAAAGAAAACAAGAXAGXCCCAGCCAAGGAXXXGGXATXACXCXCAGAAATAGAGGXGGCACAGGCXAAXGA 

CAXXAXAXCAXCCACAGAAAXAXCCXCXGCXGAGAAGGTGGCXXXGXCCXCAGAAACAGAGGXAGCCCXGGCCAG 

GGACATGACACXGCCCCCGGAAACCAACGXGAXCTXGACCAAGGAXAAAGCACTACCTTXAGAAGCAGAGGTGGC 

CCCAGXCAAGGACATGGCTCAACTCCCAGAAACAGAAAXAGCCCCGGCCAAGGAXGXGGCXCCGXCCACAGXAAA 

AGAAGXGGGCTXGTTGAAGGACAXGXCXCCACXAXCAGAAACAGAAAXGGCXCXGGGCAAGGAXGXGACXCCACC 

XCCAGAAACAGAAGXAGXXCXCAXCAAGAACGXAXGXCXGCCXCCAGAAAXGGAGGXGGCCCTGACXGAGGAXCA 

GGTCCCAGCCCTCAAAACAGAAGCACCCCXGGCTAAGGAXGGGGXTCXGACCCXGGCCAACAAXGTGACXCCAGC 

CAAAGAXGXXCCACCACXCXCAGAAACAGAGGCAACACCAGXXCCAAXXAAAGACAXGGAAAXXGCACAAACACA 

AAAAGGAAXAAGXGAGGAXXCCCAXXXAGAAXCXCXGCAGGAXGXGGGGCAGXCAGCXGCACCXACXXXCAXGAX 

XXCACCAGAAACCAXCACAGGAACGGGGAAAAAGTGCAGCXXGCCGGCCGAGGAGGATXCTGTGXXAGAAAAACT 

AGGGGAAAGGAAACCAXGCAACAGXCAACCXXCXGAGCXXXCXXCAGAGACCXCAGGAAXAGCCAGGCCAGAAGA 

AGGAAGGCCXGXGGXGAGXGGGACAGGAAAXGACAXCACCACCCCACCGAACAAGGAGCXCCCACCAAGCCCAGA 

GAAGAAAACAAAGCCXTTGGCCACCACTCAACCTGCAAAGACXXCAACATCGAAAGCCAAAACACAGCCCACXTC 

TCXCCCXAAGCAGCCAGCXCCCACCACCAXXGGXGGGXTGAAXAAAAAACCCAXGAGCCXXGCXXCAGGCXXAGX 

GCCAGCXGCCCCACCCAAACGCCCXGCCGXCGCCTCXGCCAGGCCXXCCAXCXXACCXXCAAAAGACGXGAAGCC 

AAAGCCCAXXGCAGAXGCAAAGGCXCCXGAGAAGCGGGCCTCACCATCCAAGCCAGCXTCTGCCCCAGCCXCCAG 

AXCTGGGXCCAAGAGCACXCAGACXGXXGCAAAAACCACAACAGCTGCXGCTGTXGCCXCAACXGGCCCAAGCAG 

XAGGAGCCCCXCCACGCTCCXGCCCAAGAAGCCCACXGCCAXXAAGACTGAGGGAAAACCXGCAGAAGXCAAGAA 

GATGACTGCAAAGTCTGTACCAGCTGACTTGAGXCGCCCAAAGAGCACCXCCACCAGTXCCATGAAGAAAACCAC 

CACXCXCAGXGGGACAGCCCCCGCXGCAGGGGXGGXXCCCAGCCGAGXCAAGGCCACACCCAXGCCCXCCCGGCC 

CTCCACAACXCCXXTCATAGACAAGAAGCCCACCXCGGCCAAACCCAGCXCCACCACCCCCCGGCXCAGCCGCCX 

GGCCACCAAXACXXCXGCXCCTGAXCTGAAGAATGTCCGCXCCAAGGTTGGCTCCACGGAAAACATCAAGCAXCA 

GCCXGGAGGAGGCCGGGCCAAAGXAGAGAAAAAAACAGAGGCAGCXGCXACAACCCGAAAGCCXGAAXCXAAXGC 

AGXCACXAAAACAGCCGGCCCAAXXGCAAGXGCACAGAAACAACCXGCGGGGAAAGXCCAGAXAGXCXCCAAAAA 

AGXGAGCXACAGCCAXAXXCAGXCCAAGXGXGGXXCCAAGGACAAXAXTAAGCATGXCCCTGGAGGXGGXAAXGT 

XCAGATTCAGAACAAGAAAGXGGACATCXCXAAGGXCXCCXCCAAGXGTGGGXCTAAGGCXAACAXCAAGCACAA 

GCCXGGXGGAGGAGAXGXCAAGATXGAAAGXCAGAAGXXGAACXXCAAGGAGAAGGCCCAGGCCAAGGXGGGATC 

CCXCGAXAAXGXGGGCCACCXACCXGCAGGAGGXGCXGTGAAGACXGAGGGCGGXGGCAGCGAGGCXCCXCTGXG 

XCCGGGTCCCCCXGCXGGGGAGGAGCCGGCCATCXCTGAGGCAGCGCCXGAAGCTGGCGCCCCCACXXCAGCCAG 

XGGCCXCAAXGGCCACCCCACCCXGXCAGGGGGXGGTGACCAAAGGGAGGCCCAGACCXXGGACAGCCAGATCCA 
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FIGURE 7B 

GGAGACAAGCATCTAATGATGACATTCTGGTCTCGTCTTCCGTCTCCCCCGTGTTCCCCTCTTGTCTCCCCTGTT 
CCCCTCTCCCTTCCCTCCTCCCATGTCACTGCAGATTGAGACCTACAGGCTGACGTTCCGGGCAAATGCCAGGGC 
CCGCACCGACCACGGGGCCGACATTGTCTCCCGCCCCCCACACTTCCCTGGCGGCCCCAACTCGGGCXCCCGGGT 
CCTTGGCCCCCTTTCCCGGGCTGTCCACTAGACCAGTGAGCGCTTGGGCGCCGXGCTGGGCAGCCCGCTAGGCTC 
GCCTTCCCTCCTGCTTXGCGTGCCCGGGGCAGCAGCAGCCCTGCCCCACACCTCCTCTCACTCCCCAGCCTGGGC 
CCATCTCCCTGCTTTGGTCTTGCCCCATCACTGCGCCACTGCTCCGTGGAGGAGGTTGGGAGGGGGTTGGGGTGG 
TTGAGGCTAAGTTGGGATCTAGGAGAGGAGAACCAGATTCTATCCTCATCTTTTTTTGGXTCTTTGGTCCAAACC 
CAAAAGAAACXGACATGCCCXCCCXXCTCCCXGGATCXACCTGGAGGGAAGAGXGGAGGXGGAXXCCGAGXGGXG 
ACAGGACGCXGACCGTGGAGCXXAAGCCACTGCCXCXCCCTCXGGXCCCACAAAXGGGCGCCCCCCCCTCCCCAX 
GCAGGTGGTGTCGGGCCCXXCXXGCXGCCCXGCCCCAAGXXGGGGGTCAGXGCXGCCTGTCCCCAXGCXXAACAX 
ACCCGCCXAGCTGCXGXCACAXXTXXCTTGXXXXGXCCXXXXAXXXTTTTCTAAXAACCXAAAAACXGGCAAAAX 
AGXTCXGCAGGTTGAAGCCATGTCTACAXGAAAGTCCTCAGTAAGTGXXAGAGGGAACAGGGCGGAGATAXCCTX 
AXGCCACCCCCGCXGGAGGAXGXGGGCAGCXXAGGGCCCXGGAGGCGGXGCGGCAGGGAAGAGGGGXGCAGAGGC 
XGXGGCXGGXGAGCCGGXCAGGCACACAAGGGGCCCTTGGAGCGXGGACXGGXXGGXXXXGCCAXXXXGTXGXGX 
GTATGCXGCTXXXCTTXXCXAACCAAGAGGCTGGXTTTGGCATCXCTGXCCCATXCCCTGGGAXCXGGTGGXCAG 
CCCITAGGAXAAAAAGCCAGGGCTGGAGAAGAAGAAAGGGCCAGGAGAXGAAXTC 



wo 2004/041170 



PCT/US2003/034312 



9/2825 

FIGURE 8 

MADLSLADALTEPSPDIEGEIKRDFIATLEAEAFDDWGETVGKTDYIPLLDVDEKTGNSESKKKPCSETSQIED 

TPSSKPTLLANGGHGVEGSDTTGSPTEFLEEKMAYQEYPNSQNWPEDTNFCFQPEQWDPIQTDPFKMYHDDDLA 

DLVFPSSATADTSIFAGQNDPLKDSYGMSPCNTAWPQGWSVEALNSPHSESFVSPEAVAEPPQPTAVPLELAKE 

I EMAS EERP P AQALE IMMGLKTTDMA.P S KE TEiyLALAKDMALATKTE VALAKDMESP TKLD VTLAKDMQP SM 

ALVKDMELPTEKEVALVKDVRWPTETDVSSAKNWLPTETEVAPAKDVTLLKETERASPIKMDLAPSKDMGPPKE 

NKKETERASPIKMDLAPSKDMGPPKENKIVPAKDLVLLSEIEVAQANDIISSTEISSAEKVALSSETEVALARDM 

TLPPETNVILTKDKALPIiEAEVAPVKDMAQLPETEIAPAKDVAPSTVKEVGLLKDMSPLSETEMALGKDVTPPPE 

TEWLIKIWCLPPEMEVALTEDQVPALKTEAPLAKDGVLTLANWTPAKDVPPLSETEATPVPIKDI^ 

ISEDSHLESLQDVGQSAAPTFMISPETITGTGKKCSLPAEEDSVLEKLGERKPCNSQPSELSSETSGIARPEEGR 

PWSGTGNDITTPPNKELPPSPEKKTKPLATTQPAKTSTSKAKTQPTSLPKQPAPTTIGGLNKKPMSLASGLVPA 

APPKRPAVASARPSILPSKDVKPKPIADAKAPEKRASPSKPASAPASRSGSKSTQTVAKTTTAAAVASTGPSSRS 

PSTLLPKKPTAIKTEGKPAEVKKMTAKSVPADLSRPKSTSTSSMKKTTTLSGTAPAAGWPSRVKATPMPSRPST 

TPFIDKKPTSAKPSSTTPRLSRLATNTSAPDLKNVRSKVGSXENIKHQPGGGRAKVEKKTEAAATTRKPESNAVT 

KTAGPIASAQKQPAGKVQIVSKKVSYSHIQSKCGSKDNIKHVPGGGNVQIQNKKVDISKVSSKCGSKANIKHKPG 

GGDVKIESQKLNFKEKAQAKVGSLDNVGHLPAGGAVKTEGGGSEAPLCPGPPAGEEPAISEAAPEAGAPTSASGL 

NGHPTLSGGGDQREAQTLDSQIQETSI 
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FIGURE 9 

f 

CCGGCACGAGAGGAGTTGTGAGTTTCCAAGCCCCAGCTCACTCTGACCACTTCTCTGCCTGCCCAGCATCATffiA 
GGGCCTTGCAGCTGCCCTCCTTGTCCTCGTCTGCACCATGGCCCTCTGCTCCTGTGCACAAGTTGGTACCAACAA 
AGAGCTCTGCTGCCTCGTCTATACCTCCTGGCAGATTCCACAAAAGTTCATAGTTGACTATTCTGAAACCAGCCC 
CCAGTGCCCCAAGCCAGGTGTCATCCTCCTAACCAAGAGAGGCCGGCAGATCTGTGCTGACCCCAATAAGAAGTG 
GGTCCAGAAATACATCAGCGACCTGAAGCTGAATGCCTGAGGGGCCTGGAAGCTGCGAGGpCCCAGTGAACTTGG 
TGGGCCCAGGAGGGAACAGGAGCCTGAGCCAGGGCAATGGCCCTGCCACCCTGGAGGCCACCTCTTCTAAGAGTC 
CCATCTGCTATGCCCAGCCACATTAACTAACTTTAATCTXAGTTTATGCATCATATTTCATTTTGAAATTGATTT 
CTATTGTTGAGCTGCATTATGAAATTAGTATTTTCTCTGACATCTCATGACATTGTCTTTATCATCCTTTCCCCT 
TTCCCTTCAACTCTTCGTACATTCAATGCATGGATCAATCAGTGTGATTAGCTTTCTCAGCAGACATTGTGCCAT 
ATGTATCAAATGACAAATCTTTATTGAATGGTTTTGCTCAGCACCACCTTTTAATATATTGGCAGTACTTATTAX 
ATAAAAGGTAAACCAGCATTCTCACTGTGAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 10 



MKGLAAALLVLVCTMALCSCAQVGTNKELCCLVYTSWQIPQKFIVDYSETSPQCPKPGVILLTKRGRQICADPNK 
KWVQKYISDLKLNA 
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FIGURE 11 

ATTCCGCTCTTGCTTTCCGGCAGGTGATGGCGCCCCCCGCGGCCTAGAGGTCCAGCGCCCGCCGCGAGCAGCGGA 
CAGTCCTCCTGTTGTGTCCGACCGAGAGTCCTGGTGACTTTGAACAT03TGGTGCCGCTAGCCAAGCTGTCCTGC 
CTGGCATATCAGTGCTTTCATGCCTT7VAAAATTAAGAAAAATTATCTACCTCTATGTGCTATAAGATGGTCTTCA 
ACTTCTACTGTGCCTCGAATTACTACCCATTATACTATTTATCCCCGGGATAAGGACAAGAGATGGGAAGGAGTG 
AACATGGAAAGGTTTGCAGAAGAAGCAGATGTTGTAATAGTTGGTGCAGGCCCTGCAGGGCTCTCTGCAGCTGTT 
CGTCTAAAACAGTTGGCTGTGGCACATGAAAAGGACATCCGTGTGTGTCTAGTGGAGAAAGCTGCCCAGATAGGA 
GCTCATACTCTCTCAGGGGCTTGCCTTGATCCAGGTGCTTTTAAAGAACTCTTCCCAGACTGGAAAGAGAAGGGG 
GCTCCACTTAACACTCCTGTAACAGAAGACAGATTTGGAATTTTAACAGAG2VAATACAGAATTCCTGTGCCAATT 
CTTCCAGGGCTTCCAATGAATAATCATGGCAATTACATTGTACGCTTGGGACATTTAGTGAGCTGGATGGGCGAA 
CAAGCAGAAGCCCTTGGTGTTGAAGTATACCCTGGTTATGCAGCTGCTGAGGTCCTTTTTCATGATGATGGTAGT 
GTAAAAGGAATTGCCACTAACGATGTAGGGATACAAAAGGATGGTGCACCAAAGGCAACATTTGAGAGAGGACTG 
GAACTACATGCTAAAGTCACAATTTTTGCAGAAGGTTGCCATGGACATCTAGCCAAGCAACTATATAAGAAGTTT 
GATTTGAGAGCAAATTGTGAACCTCAAACCTACGGGATXGGACTGAAGGAGTTATGGGTTATTGATGAAAAGAAC 
TGGAAACCTGGGAGAGTAGATCACACTGTTGGTTGGCCCTTGGACAGACATACCTATGGAGGATCTTTCCTCTAT 
CATTTGAATGAAGGTGAACCCCTAGTAGCTCTTGGTCTTGTGGTTGGTCTAGACTATCAGAATCCATACCTGAGT 
CCATTTAGAGAGTTCCAAAGGTGGAAACACCATCCTAGCATTCGGCCAACCTTGGAAGGTGGAAAAAGGATTGCA 
TACGGAGCCAGAGCTCTCAATGAAGGTGGCTTTCAGTCTATACCAAAACTCACCTTTCCTGGTGGTTTACTAATT 
GGTTGTAGTCCTGGTTTTATGAATGTTCCCAAGATCAAAGGTACTCACACAGCAATGAAAAGTGGAATTTTAGCA 
GCAGAATCTATTTTTAATCAACTAACTAGTGAAAATCTCCAATCAAAGACAATAGGACTCCATGTAACTGAATAT 
GAGGACAATTTGAAGAACTCATGGGTATGGAAAGAGCTATATTCTGTTAGAAATATAAGACCGTCCTGCCACGGA 
GTACXGGGTGTATATGGAGGGATGATTTACACTGGAATCTTTTACTGGATATTGAGAGGAATGGAGCCGTGGACT 
CTGAAACATAAAGGTTCTGACTTTGAACGGCTCAAGCCAGCCAAGGATTGCACACCTATTGAGTATCCAAAACCC 
GATGGACAGATCAGTTTTGACCTCTTGTCATCTGTGGCTCTGAGTGGTACTAATCATGAACATGACCAGCCGGCA 
CACTTAACCTTAAGGGATGACAGTATACCTGTAAAXAGAAATCTGTCGATATATGATGGGCCCGAGCAGCGATTC 
TGTCCTGCAGGAGTTTATGAATTTGTACCTGTGGAACAAGGTGATGGATTTCGGTTACAGATAAATGCTCAGAAC 
TGTGTACATTGTAAAACAXGTGATATTAAAGATCCAAGTCAGAATATTAACTGGGTGGTACCTGAAGGTGGAGGA 
GGACCTGCTTACAATGGAATG5PAAACTGCAGCTAGCCAGTTTCTTTCAAGTATGGCAAGCTAACGTTAAAATGTT 
TAGAGATTAACAGATTTCAGAATGTCTTTCTGCATATTACTGAACAGAATAGTCACAAAATGATTATCAAATAAA 
AAT T T TAT AC T AT AAAAAAAAAAA 
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FIGURE 12 

MLVPLAKLSCLAYQCFHALKIKKNYLPLCAIRWSSTSTVPRITTHYTIYPRDKDKRWEGVNMERFAEEADWIVG 
AGPAGLSAAVRLKQLAVAHEKDIRVCLVEKAAQIGAHTLSGACLDPGAFKELFPDWKEKGAPLNTPVTEDRFGIL 
TEKYRIPVPILPGLPMNNHGNYIVRLGHLVSWMGEQAEALGVEVYPGYAAAEVLFHDDGSVKGIATNDVGIQKDG 
APKATFERGLELHAKVTIFAEGCHGHLAKQLYKKFDLRANCEPQTYGIGLKELWVIDEKNWKPGRVDHTVGWPLD 
RHTYGGSFLYHLNEGEPLVALGLWGLDYQNPYLSPFREFQRWKHHPSIRPTLEGGKRIAYGARALNEGGFQSIP 
KLTFPGGLLIGCSPGFMNVPKIKGTHTAMKSGILAAESIFNQLTSENLQSKTIGLHVTEYEDNLKNSWVWKELYS 
VRNIRPSCHGVLGVYGGMIYTGIFYWILRGMEPWTLKHKGSDFERLKPAKDCTPIEYPKPDGQISFDLLSSVALS 
GTNHEHDQPAHLTLRDDSIPVNRNLSIYDGPEQRFCPAGVYEFVPVEQGDGFRLQINAQNCVHCKTCDIKDPSQN 
INWWPEGGGGPAYNGM 
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FIGURE 13 

CTCAGCTTCTTTGCGTAACCAATACTGGAAGGCATTTAAAGGACCTCTGCCGCCTCAGACCTTGCAGTTAACTCC 
GCCCTGACCCACCCTTCCCG ATGC AGTCCCTGATGCAGGCTCCCCTCCTGATCGCCCTGGGCTTGCTTCTCGCGA 
CCCCTGCGCAAGCCCACCTGAAAAAGCCATCCCAGCTCAGTAGCTTTTCCTGGGATAACTGTGATGAAGGGAAGG 
ACCCTGCGGTGATCAGAAGCCTGACTCTGGAGCCTGACCCCATCGTCGTTCCTGGAAATGTGACCCTCAGTGTCG 
TGGGCAGCACCAGTGTCCCCCTGAGTTCTCCTCTGAAGGTGGATTTAGTTTTGGAGAAGGAGGTGGCTGGCCTCT 
GGATCAAGATCCCATGCACAGACTACATTGGCAGCTGTACCTTTGAACACTTCTGTGATGTGCTTGACATGTTAA 
TTCCTACTGGGGAGCCCTGCCCAGAGCCCCTGCGTACCTATGGGCTTCCTTGCCACTGTCCCTTCAAAGAAGGAA 
CCTACTCACTGCCCAAGAGCGAATTCGTTGTGCCTGACCTGGAGCTGCCCAGTTGGCTCACCACCGGGAACTACC 
GCATAGAGAGCGTCCTGAGCAGCAGTGGGAAGCGTCTGGGCTGCATCAAGATCGCTGCCTCTCTAAAGGGCATAT 
AGCATGGCATCTGCCACAGCAGAATGGAGCGGTGTGAGGAAGGTCCCTTTTCCTCTGTTTTGTGTTTGCCAAGGC 
CAAACTCCCACTCTCTGCCCCCCTTTAATCCCCTTTCTACAGTGAGTCCACTACCCTCACTGAAAATCATTTTGT 
ACCACTTACATTXTAGGCTGGGGCAAGCAGCCCTGACCTAAGGGAGAATGAGTTGGACAGTTCTTGATAGCCCAG 
GGCATCTGCTGGGCTGACCACGTTACTCATCCCCGTTAACATTCTCTCTAAAGAGCCTCGTTCATTTCCAAAGCA 
GTTAAGGAATGGGAACAGAGTGTTTTAGGACCTGAAGAATCTTTATGACTCTCTCTCTTTCTCTCTTTTTTTTTT 
TGTCACTAAGTTAAAAGCGAAGTGAGAGTATTAACGTTTTTGTTCTCCTCCGGCCCCCTGTTACAATGAAGGGGC 
AAAAGTATTTGCTCTTAGTCTATTCCTCCCTTAACTTCXGTGACTAATTTTTATTTCCTTTCTAGATTTGCCCAA 
TTAATACXAGGGTGCAGTGTATCCTGGAGAGGTAGGGTGTGTGGGGGAGGAATCCCTTGGGGGAGATATTAGGAG 
TGCTCTGTTGTTTACAAACTCAGGTACCCGCAGGGCCTAGCAAGAGACTTAAATGACTGATAAGAACCGTGAGAA 
ACATGTTGCTTCCAGGCTTGATTTCGATTTTTCGCTTTTTTTTTTTGGGACGGAGTCTCGCTTTGTCACCAGGCT 
GGAGTGCAGTGGTGCAATCTCACCTCACTGCAACCTCCGCCTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCTC 
CCAAGTAGCTTGGACTACAGGCCCTGCCACCACGCCCGGCTAATTTGTGTATTTTTAGTAGAGATGGGGTTTCAC 
CATGTTGGCCAGGATGGTCTCGATCTCTTGACCTCGTGATCCGTCCACCTTGGCCTTGCAAAGCGCTGGATTACA 
GGCATGAGCCACTACACCCAGCCGATTTTTCCTTTTTGATTAAAGATGCTATTACAATGTAAAXATTTCTTACAC 
AGAAAGTCACAGCACAXGXGCCCATTGAXACAAGGCTGCXGAGGCCXGGTCXCCAGXTGGAAAXAXAAXXAAGGG 
XGGCAGGGACXGGAGTCAGXXGGAGAGTGCATAGCCAGTCTGTGAAGACAACTGCCAGATACTGGCAAXACTCCA 
GCCXGGTGACAGAGXGAGACXCTGXCXCAAAAAAAAAGXXTCAAXGTXXACXCCTAGAGAAGCCAAAAAXCCAGA 
XXXGTAXAXGAAATCXXACCATTTTAAAAGAXXGGCAGCTAAXXAXTTTXTTAAAAAGCXGTGCAGXGXGATGXG 
TCCCAAACGGACTGGCXCAXGGGTGGCCACGXCACAACCTCXGAXCTCAGACCGTGCATGCCTTGTCCXCTTAAG 
ACAACXCCTGXGGCACCGTTTCXCCCTCCACAGGGCCAAAGCCAXAGXGXCCGGTCCCAAGGACAAGGCTCXTCC 
AGXGCTAGGAGAGGTATGAGCAGCCTCXCACCXGXGAGCXGXGGGGAXCACAAGGCXGCCXGCCTCAGTCXXGGA 
GTCCTGTXGGGTGAAXGAGGCAGATGGGAAAGAGCCTCACCAGCAGCTGCXXTTGGAGCAGGGGTCCAAGGAAGA 
GAGGGTGGCCXCGACATCAAACXGCCTGGAXXXXXCXACCACCCTGXXACAXCAXAACAACTXCTGAAACACACA 
CCAGCCCXGAGXXCTGGGCXCAXTXGAAGCCXGGAAXAGCAATAAAXCXTTTTAACXXGCAAAAAAAAAAAAAAA 
AAA 
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FIGURE 14 

MQSLMQAPLLIALGLLLATPAQAHLKKPSQLSSFSWDNCDEGKDPAVIRSLTLEPDPIWPGNVTLSWGSTSVP 
LSSPLKVDLVLEKEVAGLWIKIPCTDYIGSCTFEHFCDVLDMLIPTGEPCPEPLRTYGLPCHCPFKEGTYSLPKS 
EFWPDLELPSWLTTGNYRIESVLSSSGKRLGCIKIAASLKGI 
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FIGURE 15 

GAAAG ATGG CGTCCTCGGAGCAGGCAGAGCAGCC6AGCCAGCCAAGCTCTACTCCAGGAAGTGAAAATGTGCTGC 
CTCGAGAGCCGCTGATTGCCACGGCAGTGAAGTTTCTACAGAATTCCCGGGTCCGCCAGAGCCCACTTGCAACCA 
GGAGAGCATTTCTAAAGAAGAAAGGGCTGACAGATGAAGAGATTGATATGGCCTTCCAGCAGTCGGGCACTGCTG 
CCGATGAGCCXTCGTCCTTGGGCCCAGCCACACAGGTGGTTCCTGTCCAGCCCCCTCACCTCATATCTCAGCCAT 
ACAGTCCCGCAGGCTCCCGATGGCGAGATTACGGCGCCCTGGCCATCATCATGGCAGGCATTGCATTTGGCTTTC 
ACCAGCTCTACAAGAAATACCTGCTCCCCCTCATCCTGGGCGGCCGAGAGGACAGAAAGCAGCTGGAGAGGATGG 
AGGCCGGTCTCTCTGAGCTGAGTGGCAGCGTGGCCCAGACAGTGACTCAGTTACAGACGACCCTCGCCXCCGTCC 
AGGAGCTGCTGATTCAGCAGCAGCAGAAGATCCAGGAGCTTGCCCACGAGCTGGCCGCTGCCAAGGCCACCACAT 
CCACCAACTGGATCCTGGAGTCCCAGAATATCAACGAACTCAAGTCCGAAATTAACTCCTTGAAAGGGCTTCTTT 
TAAATCGGAGGCAGTTCCCTCCATCCCCATCAGCCCCGAAGATCCCCTCCTGGCAGATCCCAGTCAAGTCACCGT 
CACCCTCCAGCCCTGCGGCCGTGAACCACCACAGCAGCAGCGACATCTCACCTGTCAGCAACGAGTCCACGTCGT 
CCTCGCCTGGGAAGGAGGGCCACAGCCCCGAGGGCTCCACGGTCACCTACCACTTGCTGGGCCCCCAGGAGGAAG 
GCGAGGGGGTGGTGGACGTCAAGGGCCAGGTGCGGATGGAGGTGCAAGGCGAGGAGGAGAAGAGGGAGGACAAGG 
AGGACGAGGAGGATGAGGAGGATGATGATGTGAGCCATGTGGACGAGGAGGACTGCCTGGGGGTGCAGAGGGAGG 
ACCGCCGGGGCGGGGATGGGCAGATCAACGAGCAGGTGGAGAAGCTGCGGCGGCCCGAGGGCGCCAGCAACGAGA 
GTGAGCGGGAC TAGG GCTGCGCCTGCTGCCTCCAGCCCTGAGGATGGCATCTAGTGTGCCCGTGCGTGGCCATAC 
CCTGCCTCCCTCTCTGGCCCTGGGAGGGCAGCTTGGAGCCCAGGTAGGGGGCAGAGCTGTCCTCAGCTGCACTGC 
GGCCTGGTGGCAGTGTGGGGAGTCACACTTCTGTCCACCTGGCCTCCTCTCGCCTGGCCGCCAGCCCCAGCCCCA 
GCCCCAGCCCCAGGCCCAGCTGCCTTTGGCTTTGATCTCAAGTCAGGCTGAAGGCAGCGAAGCCTCGGGGCCCAA 
GCCCCTCCCCAGCCCCCTCTCCCGGACAGACGCCTTGCCCAGGGTGTGTTTGCTGAGTGTCTTGACTACCGTGAC 
ACCACGCATGGCCAGAGCTAGCGTCCCTACTGCCTCCCGACTCCTCAGTGGAGGAGGAGCTGCGGTCCCTCTGGT 
GTCTGCCATCCCCCTCCCTCCCTGGGCCCGGCCCTGGACCCGTCAGGTGCCTGTCCCCAGCCCCAACCCCACTCA 
TGCCCCGTCGTCCTCCCAGACAAATGAAACCACGCTGCGCTTCCGATGCCCCCGCTTGCCGTGTAATGGTTCAGC 
TAATCCCATGGCGAGATGGGGGCTCATTCCGGAGGAGGAGCCAGGCAGCAGGGCCTTCCTGACCAACAGCCAGXT 
TTGTCCTTCCCCCCAGGAAAAAAATGTTCATTTGTGTGATCATGTATAGACCTCAGAACGGAAGAXAGGACTGTA 
TATAATTGTAATAAATACCAGTTGCCACTATTT 
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FIGURE 16 

MASSEQAEQPSQPSSTPGSENVLPREPLIATAVKFLQNSRVRQSPLATRRAFLKKKGLTDEEIDMAFQQSGTAAD 
EPSSLGPATQWPVQPPHLISQPYSPAGSRWRDYGALAIIMAGIAFGFHQLYKKYLLPLILGGREDRKQLERMEA 
GLSELSGSVAQTVTQLQTTLASVQELLIQQQQKIQELAHELAAAKATTSTNWILESQNINELKSEINSLKGLLLN 
RRQFPPSPSAPKIPSWQIPVKSPSPSSPAAVNHHSSSDISPVSNESTSSSPGKEGHSPEGSTVTYHLLGPQEEGE 
GWDVKGQVRMEVQGEEEKREDKEDEEDEEDDDVSHVDEEDCLGVQREDRRGGDGQINEQVEKLRRPEGASNESE 
RD 
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FIGURE 17A 

GCTGTTTTGACAACATGGCGGCGCCCATGGTCCGTGGCCCGGCAGTGCTCGCCTAAAGGTGGAGAACGAGGAGTA 

GAGGAGGCCGCAGCCAGAGCCTGTGAGCAGATCCAGACCTACAGATAAAAAACATTATTTAATCTATCTGGGATT 

TACTCCGGCTTATGATTTGAGGGCCTTCTCACCTTCTGAAGAATGGCTTCTGTTTGGCAGAGATTGGGTTTTTAT 

GCCTCTCTTCTGAAAAGACAGCTAAATGGTGGGCCAGATGTCATCAAGTGGGAAAGGAGAGTAATTCCCGGATGT 

ACCAGAAGCATCTACAGTGCCACGGGAAAGTGGACAAAAGAGTATACATTGCAGACAAGAAAGGATGTTGAGAAA 

TGGTGGCATCAACGAATAAAAGAACAGGCCTCCAAAATTTCAGAAGCTGATAAATCG2U^GCCAAAATTTTACGTG 

CTTTCCATGTTCCCTTATCCTTCTGGTAAGCTGCACATGGGCCATGTGCGTGTCTACACCATCAGCGACACCATA 

GCACGGTTCCAGAAGATGAGAGGGATGCAGGTCATCAACCCCATGGGATGGGATGCTTTTGGATTGCCTGCTGAA 

AATGCCGCAGTCGAGAGGAATCTACATCCACAAAGTTGGACACAAAGTAATATTAAACACATGAGGAAACAGCTT 

GATCGTCXGGGCCTGTGTTTCAGCTGGGATAGGGAAATAACTACGTGTTTGCCAGATTACTACAAGTGGACTCAG 

TATCTCTTTATTAAACTGTATGAGGCTGGGCTGGCCTATCAAAAGGAGGCCCTGGTTAACTGGGACCCAGTGGAT 

CAAACAGTGCTTGCCAATGAGCAGGTGGATGAACATGGCTGTTCATGGCGTTCTGGAGCAAAGGTGGAACAGAAG 

TACCTCAGACAATGGTTTATTAAGACAACCGCTTATGCAAAGGCCATGCAGGACGCGTTGGCAGACCTTCCAGAA 

TGGTATGGAATAAAAGGCATGCAAGCCCACTGGATTGGGGACTGTGTGGGCTGCCACCTGGACTTCACATTAAAG 

GTTCATGGGCAAGCCACGGGCGAAAAGCTGACTGCCTATACGGCCACCCCTGAAGCCATTTATGGCACCTCCCAC 

GTGGCCATCTCGCCCAGCCACAGACTCCTACATGGGCACAGCTCTCTGAAGGAAGCCTTGAGGATGGCCCTTGTC 

CCTGGCAAAGATTGCCTCACGCCTGTAATGGCTGTGAACATGCTTACCCAGCAGGAGGTCCCTGTCGTTATTTXG 

GCCAAAGCTGACTTGGAAGGCTCTCTGGATTCAAAAATAGGAATTCCCAGTACTAGCTCAGAGGACACCATCTTA 

GCCCAAACCCTGGGCCTGGCCTACTCTGAAGTCATTGAAACTTTGCCAGATGGCACAGAGAGACTGAGCAGCTCT 

GCTGAGTTCACAGGTATGACCCGGCAGGATGCTTTTCTAGCCCTGACTCAGAAAGCCCGGGGGAAGAGAGTGGGT 

GGAGACGTGACAAGTGATAAACTGAAAGACTGGCTGATTTCACGGCAGCGGTACTGGGGCACACCAATCCCCATT 

GTCCACTGCCCAGTCTGTGGCCCCACACCTGTGCCCCTGGAGGACTTGCCTGTGACCCTGCCCAACATCGCGTCT 

TTCACTGGCAAGGGAGGCCCCCCACTGGCCATGGCTTCAGAGXGGGTGAACXGCTCCTGCCCAAGGTGCAAGGGA 

GCAGCCAAGAGAGAGACAGACACGAXGGAXACCTTTGXXGAXXCXGCTXGGXACXACTXCAGAXACACTGACCCT 

CAXAATCCACACAGCCCTXTTAACACAGCAGTGGCCGATTACXGGATGCCTGTGGATTTGTACAXXGGAGGGAAA 

GAACAXGCCGTCAXGCACTXGXXCXATGCAAGATXCTTTAGTCATTTXXGCCATGAXCAAAAAAXGGTTAAACAX 

AGGGAGCCXXXTCAXAAGCTGCXGGCCCAAGGCCXXAXCAAGGGGCAGACAXXCCGCCXACCAXCTGGACAGXAT 

CXACAGAGAGAGGAAGXGGATCTCACAGGTTCCGTTCCTGTXCATGCAAAAACGAAAGAGAAGTTAGAGGTGACG 

XGGGAGAAGATGAGXAAGTCCAAACACAACGGGGXGGACCCAGAGGAAGTTGTGGAGCAGTATGGGATCGACACG 

ATXCGGCXCXACAXCCXXXXXGCXGCCCCTCCXGAGAAGGAXATCXXGXGGGATGXGAAAACTGAXGCXCXCCCX 

GGGGTGCTGAGATGGCAACAACGACTGTGGACCTTGACAACTCGGTTTATTGAGGCCAGGGCTTCTGGGAAGXCT 

CCCCAGCCTCAGCXGCTGAGXAACAAGGAGAAAGCTGAGGCCAGGAAGCXCTGGGAGXACAAGAACXCCGTCATC 

TCTCAGGTGACCACCCATTXCACAGAGGACXXCTCACXGAAXXCXGCAATTXCXCAGCXGAXGGGACXCAGCAAX 

GCCCTCTCGCAAGCCXCXCAGAGCGXCAXXCTCCACAGCCCCGAGTTTGAGGATGCTTTGTGXGCCCTGATGGTA 

ATGGCXGCTCCACXGGCCCCXCATGXAACCTCAGAGAXCXGGGCAGGCCTGGCGCXGGXGCCGAGGAAGCTCXGT 

GCCCACTACACXTGGGAXGCCAGXGXGCXGCXCCAGGCAXGGCCXGCXGXGGACCCGGAGXXCCXGCAGCAGCCX 

GAGGTTGTCCAGATGGCAGTTCTGATCAACAATAAAGCTTGXGGCAAAATTCCTGTGCCCCAACAAGTTGCCCGG 

GACCAGGACAAAGXCCACGAAXXTGXTCTXCAAAGCGAGCXGGGXGXCAGGCTTTTGCAAGGACGAAGCAXCAAG 

AAGXCCXXCCXXXCCCCGAGAACXGCCCTCATCAACTTCCXGGXGCAAGATgPGACAGCCAGGAGGCXGCAGCXAC 

CACGAGGGCCXCXGAGGAACCXCCXXCCAGGCCTGGGATGAGGGGGCGATGTCTGCTGGCCCAGGGGAAGGGAAA 

AGACAAAXGXCTTGACTGXXGACCXCGGXCCXGXGGCAGACTGCAGXCAACAGTGXGCCXCXGXAGXGXGGCCXG 

GXGCXGGGGTGAAGGTGAGCTGGGCAAAGGAGAAAXAXGAGCXACXGAGGAGGGGGXXGGACAXCCTGCCCCXCA 

CCCCCCACCCACACXGCAGGTAGAGGAGGCCATCTGAXCCCATGGGAAGCCAXCAGAGACACXGCTGGTGGGAGC 

AGGAAGGAGCAGXGCCCCTCGAGCAGCCAGGAAGCCTGCGGAXCXGGGAAATGGCXCTGCCXTAGGCACTXCXCG 

GGAATXXGAGGCCAGCCXGAGGAACXGCAGGACTCAGGTGCAATGTGCCAGCCACXXGGAACXGCXAACXGAGCC 

XCCAGAXGGXAGXGAAXGGXCXCXXXGCCXXCAGGCXGGAXGAGGAAGXCAXTXAGGAAAXGXXCAAAXAACCAA 

XATGXGGAAATGGACACAGGGAXCTXCTGAAGXTGCTXTGAATCAAAAGGCAGGCAGTGCXGGTTCCTCXGCCTG 

XGXCCCCACCACXCCCCAGCXCTGXCAXGCAGGCCXGXCCXCCCCAACCCCAGCXGGAXGTGCCXCCCAGGCCXG 
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FIGURE 17B 

CTGTGGTTCTGACACACAGGATCCCAGGCAAGGCACCACTTCCTCACATGAATGAGGAGCAGCAAGTCATAACCA 
CTCCCTTGGGTATACAATTTGCTGTGTAGTGAAGTGGAACCAGGCTCAGGCTGCTGGTCCCAACCTCAGAGCCCC 
ACCGCAGCCCAGTAGGGATGCAGCACGCCCCAGAGGGCTCATGTGGGCCCCAGATGGCAATGCCACCATTGTTGA 
TGTGACTCCAGAGCCAGTTATTAGGAAGAGCAAGCTCACCACAGAGGAGTGGAACTGAGGCCCCCCAGATGTTGC 
CTCCGGTGTCCAAGCCACAGCGGTCTGGCTGTTGGGAAGATGGCCAGGAATGGACTCATACCATTGGCACATTAG 
GCTAATCCTGGTTTTATGTGAAGTCAGCAATTAAGTGTTCCCACTAGAACTGACCXAAGCCACTGATTAATATTT 
AATGAGGGAAGGTAGGGGAGAATCTAGCCATTTTATAATGCCAGAAATCTATATATGTTATCTGATGCCATTTTT 
CTGAAGTAGCCTCACATGTGGTCCCCCTGCAGTTCAGCAGTTAACAGATGACTTTTTTAGTGTAATAAAATGTTT 
ATCATCTATG 
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FIGURE 18 

MASVWQRLGFYASLLKRQLNGGPDVIKWERRVIPGCTRSIYSATGKWTKEYTLQTRKDVEKWWHQRIKEQASKIS 

EADKSKPKFYVLSMFPYPSGKLHMGHVRVYTISDTIARFQKMRGMQVINPMGWDAFGLPAENAAVERNLHPQSWT 

QSNIKHMRKQLDRLGLCFSWDREITTCLPDYYKWTQYLFIKLYEAGLAYQKEALVNWDPVDQTVLANEQVDEHGC 

SWRSGAKVEQKYLRQWFIKTTAYAKAMQDALADLPEWYGIKGMQAHWIGDCVGCHLDFTLKVHGQATGEKLTAYT 

ATPEAIYGTSHVAISPSHRLLHGHSSLKEALRMALVPGKDCLTPVMAVNMLTQQEVPWILAKADLEGSLDSKIG 

IPSTSSEDTILAQTLGLAYSEVIETLPDGTERLSSSAEFTGMTRQDAFLALTQKARGKRVGGDVTSDKLKDWLIS 

RQRYWGTPIPIVHCPVCGPTPVPLEDLPVTLPNIASFTGKGGPPLAMASEWVNCSCPRCKGAAKRETDTMDTFVD 

SAWYYFRYTDPHNPHSPFNTAVADYWMPVDLYIGGKEHAVMHLFYARFFSHFCHDQKMV^HREPFHKLLAQG 

GQTFRLPSGQYLQREEVDLTGSVPVHAKTKEKLEVTWEKMSKSKHNGVDPEEWEQYGIDTIRLYILFAAPPEKD 

ILWDVKTDALPGVLRWQQRLWTLTTRFIEARASGKSPQPQLLSNKEKAEARKLWEYKNSVISQVTTHFTEDFSLN 

SAISQLMGLSNALSQASQSVILHSPEFEDALCALMVMAAPLAPHVTSEIWAGLALVPRKLCAHYTWDASVLLQAW 

PAVDPEFLQQPEWQMAVLINNKACGKIPVPQQVARDQDKVHEFVLQSELGVRLLQGRSIKKSFLSPRTALINFL 

VQD 
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FIGURE 19 

TATATTGGCAGTTATTGAGGGTAAAGCAATATATTGTAACAGAATGTATAAATATTTTTGATAAAACAGTCTATA 
TTTTATTAAAAAATGAATTATAACCCATTTTCAGTTTTGCCTGCATCATAAGAGTGAGCACTCCATTGCTTTCTT 
TCCTGGCCACACTGCTACAATCCAGCACTAACTATCCATGTCCAGGGTAAGGAICGAGATCGAGAAGCCCACACT 
GCCAGTGAAAAAGCTACGTCTTTACTGCATAAATTAGAGGAAGCAATTTCGGAACAACGGAACCXTCAAACTATA 
AATACTGAATTATCGAACACTTGCCAGGCACTTCAGCAGAAGACAAGGAAACTGAAGAAGCTTXTTAGATGAGGA 
ATTTCCTCACTATGATTCCCTGTCCTGCGCAGATGCAATTCAACAACCTCTTCAAGAAAAATTGAAGCAGTGTTG 
CCACAAACTATATGGTGGTCAAGAAGCAAGAATACATCAGACACCCCTGACCTTGAAACATACGTGCTGGTACAC 
ACCTCTGCTGGATGCCTTATCTCTGGATAGTTTTACAGCAGTTCCAACCCTGGAATCAACACCTTTCTCAGGTGT 
AGCCAACCAAATCCACACTCTGTGTGAAAGGCCCACATATGGAGAAGTAAAGGATGGTGCTTTGGATGTAAAAAG 
ACAACACAAGTGCCCAGGCCCCACAAGTGGCCCCAGCCCAGGAACGAATCTCTCAGGCTGCATCAGGATGAATGA 
TGACCCAAGTATGGAAGAGAATGGTGTTGAACGCGTGTGTCCTGAGAGCCTGCTGCAGTCCAGGGGATATTCCTC 
ACTACCATTACCCAGACACACTTCATCGACAGACGGTACTATAACTTCAAGTGATCCTGGATTAGAAATTCTGAA 
TATGGCTTCTTGTGACCTTGACAGAAACTCGCTCTGTAAGAAAGAGGAGGATACAAGATCAGCTTCTCCCACGAT 
AGAGGCCCAAGGCACAAGTCCAGCTCATGATAATATTGCATTCCAAGACTCTACGAGTAAGGATAAAACCATATT 
AAATCTGGAAGCCAAAGAGGAACCAGAAACAATAGAAGAACATAAAAAAGAACATGCTTCAGGAGACTCTGTGGT 
TTCCCCTCTTCCTGTAACCACTGTGAAATCGGTTAACGTTAGACAAAGTGAGAACACTTCTGCTAATGAGAAGGA 
GGTGGAGGCAGAATTTCTCAGATTATCTTTGGGATTTAAGTGTGACTGGTTTACCTTGGAGAAGAGAGTGAAGCT 
TGAAGAGAGGTCCCGTGACTGGGCAGAAGAAAATTTGAAGAAAGAAATCACTAACTCTTTAAAACTATTAGAGTC 
TTTAACACCTCTGTGTGAAGATGACAACCAGGCACAGGAAATCATTAAGAAGCTGGAGAAGAGTATAAAGTTTCT 
TAGCCAGTGTGCAGCACGAGTGGCCAGTAGGGCTGAGATGTTGGGAGCCATCAATCAGGAAAGCCGGGTTAGTAA 
AGCAGTTGAAGTGATGATTCAGCACGTAGAAAACTTGAAGAGGATGTATGCCAAAGAGCACGCTGAATTAGAAGA 
ACTGAAACAGGTTCTTCTGCAGAATGAAAGGTCTTTCAATCCTCTTGAAGATGATGATGACTGCCAAATTAAAAA 
ACGTTCAGCTTCTCTAAACTCCAAGCCATCTTCTCTACGAAGAGTGACTATTGCCTCTTTACCCAGAAATATTGG 
AAATGCAGGAATGGTGGCXGGGATGGAAAATAATGATCGATTCAGTAGAAGGTCAAGCAGTTGGCGTATTTTGGG 
GTCAAAGCAGAGTGAACACCGTCCCTCATTACCTCGATTTATTAGCACCTATTCCTGGGCAGATGCTGAAGAAGA 
AAAAXGT G AAC XAAAAAC TAAAGAT GAG T C AG AGCC ATC TGGAGAAGAAACAGT AGAAAGGAC AAGGAAGCC AAG 
TCTTTCTGAAAAGAAAAATAATCCATCAAAGTGGGAXGTCTCTTCAGTTTATGACACAATAGCTTCCTGGGCAAC 
AAATCTCAAGTCCTCCATCAGAAAGGCXAATAAGGCCCTCTGGCTCXCTATTGCATXCATTGTACXGTTTGCAGC 
XXTGATGAGCTXCCXCACAGGCCAATTAXTCCAGAAGXCTGXGGATGCCGCXCCCACACAGCAAGAGGACXCATG 
GACGXCXCXAGAACATAXCTXGXGGCCATXXACCAGACTCCGACACAATGGGCCACCACCAGXGTG^AGCAGGA 
CATCCTAATATATGGATCTTGATTTTTAAGTTTCAGTATCTGAACXTCGTAAAXXAGXAACTXXTAGCTGGGAAA 
GTAXAGCATGAAACCAGAGGXXCXCAGAAXGACCGTAAGATAGCXTACAXTXCCTCTXXTXGCCTTTATCTCCCC 
AACXAAAATACAAXGGG 
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FIGURE 20 

MESTPFSGVANQIHTLCERPTYGEVKDGALDVKRQHKCPGPTSGPSPGTNLSGCIRMNDDPSMEENGVERVCPES 
LLQSRGYSSLPLPRHTSSTDGTITSSDPGLEILNMASCDLDRNSLCKKEEDTRSASPTIEAQGTSPAHDNIAFQD 
STSKDKTILNLEAKEEPETIEEHKKEHASGDSWSPLPVTTVKSVNVRQSENTSANEKEVEAEFLRLSLGFKCDW 
FTLEKRVKLEERSRDWAEENLKKEITNSLKLLESLTPLCEDDNQAQEIIKKLEKSIKFLSQCAARVASRAEMLGA 
INQESRVSKAVEVMIQHVENLKRMYAKEHAELEELKQVLLQNERSFNPLEDDDDCQIKKRSASLNSKPSSLRRVT 
lASLPRNIGNAGMVAGMENNDRFSRRSSSWRILGSKQSEHRPSLPRFISTYSWADAEEEKCELKTKDDSEPSGEE 
TVERTRKPSLSEKKNNPSKWDVSSVYDXIASWATNLKSSIRKANKALWLSIAFIVLFAALMSFLXGQLFQKSVDA 
APTQQEDSWTSLEHILWPFTRLRHNGPPPV 
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FIGURE 21 

ATGGTGCTGTGCCCGGTGATTGGGAAGCTGCTGCACAA6CGCGT6GTGCTGGCCAGCGCCTCCCCACGCCGTCAG 
GAGATCCTCAGCAACGCGGGTCTCAGGTTTGAGGTGGTCCCCTCCAAGTTTAAAGAGAAGCTGGACAAAGCCTCC 
TTCGCTACTCCGTATGGGTACGCCATGGAGACCGCCAAGCAGAAGGCCCTGGAGGTGGCCAACCGGCTATACCAG 
AAAGACCTGCGGGCCCCCGACGTGGTCATTGGAGCGGACACGATCGTGACGGTCGGGGGGCTGATTCTGGAGAAG 
CCGGTGGACAAGCAGGACGCCTACAGGATGCTGTCCCGGTTGAGTGGGAGAGAACACAGCGTGTTCACAGGTGTC 
GCGATCGTCCACTGCTCCAGCAAAGACCATCAGCTGGACACCAGGGTCTCGGAATTCTACGAGGAAACGAAGGTG 
AAGTTCTCGGAGCTGTCCGAGGAGCTGCTCTGGGAATACGTCCACAGCGGGGAGCCCATGGACAAAGCTGGCGGC 
TACGGTATCCAGGCCCTGGGCGGCATGCTGGTGGAGTCCGTACACGGGGACTTTCTGAACGTGGTGGGATTCCCG 
CTGAACCACTTCTGCAAGCAGCTGGTGAAGCTCTACTACCCGCCCCGTCCGGAGGACCTGCGGCGGAGTGTCAAG 
CAOGACTCCATCCCGGCCGCGGACACCTTCGAAGACCTCAGTGACGTGGAGGGGGGCGGCTCGGAGCCCACTCAG 
AGGGACGCGGGCAGCCGCGATGAGAAGGCCGAGGCGGGAGAGGCGGGACAGGCCACGGCAGAGGCTGAGTGTCAC 
AGGACTCGGGAGACCCTGCCTCCGTTCCCGACACGCCTCCTGGAGCTGATTGAGGGCTTTATGCTATCCAAGGGC 
CTGCTCACCGCTTGCAAACTGAAGGTGTTCGATTTATTAAAAGATGAAGCACCCCAGAAGGCTGCGGATATTGCC 
AGCAAAGTGGACGCCTCTGCGTGXGGAATGGAGAGGCTTCTGGACATCTGTGCTGCCATGGGGCTCCTGGAG2^G 
ACAGAGCAAGGTTACAGTAACACAGAGACAGCGAACGTCTACCTGGCATCGGATGGCGAATACTCTCTGCACGGC 
TTCATCATGCACAATAATGACCTCACATGGAACCTCTTTACATACCTGGAGTTTGCCATCCGAGAGGGAACAAAC 
CAGCACCACAGGGCGTTGGGGAAGAAGGCGGAAGATCTGTTCCAGGATGCGTACTACCAGAGCCCGGAGACGCGG 
CTGAGGTTCATGCGGGCCATGCACGGCATGACGAAGCTGACTGCGTGCCAGGTGGCCACGGCCTTCAATCTGTCC 
CGCTTCTCCTCCGCCTGCGACGTGGGAGGCTGCACGGGTGCACTGGCCCGAGAGCTGGCCCGTGAGTACCCTCGT 
ATGCAGGTGACTGTGTTTGACCTCCCAGACATTATCGAGCTGGCCGCCCACTTCCAACCCCCCGGACCGCAGGCA 
GTGCAGATCCACTTCGCAGCAGGTGACTTTTTCAGGGACCCCCTCCCCAGCGCTGAGCTGTACGTCCTGTGCCGG 
ATCCTGCATGACTGGCCAGACGACAAAGTCCACAAGTTACTCAGCAAGGTCGCCGAGAGCTGCAAGCCAGGGGCC 
GGCCTGCTGCTGGTGGAGACGCTCCTGGATGAGGAGAAGAGGGTGGCGCAGCGCGCCCTGATGCAGTCACTGAAC 
ATGCTGGTGCAGACTGAAGGCAAGGAGCGGAGCCTGGGCGAGTATCAGTGCTTGCTGGAGCTGCACGGCTTCCAC 
CAGGTGCAGGTGGTGCACTTGGGGGGTGTCCTGGATGCCATCTTGCCACCAAAGTGGCCCCCTGAAGCCCAGGCA 

GCATGTTCATT ATAG 
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FIGURE 22 

MVLCPVIGKLLHKRWLASASPRRQEILSNAGLRFEWPSKFKEKLDKASFATPYGYAMETAKQKALEVANRLYQ 

KDLRAPDWIGADTIVTVGGLILEKPVDKQDAYRMLSRLSGREHSVFTGVAIVHCSSKDHQLDTRVSEFYEETKV 

KFSELSEELLWEYVHSGEPMDKAGGYGIQALGGMLVESVHGDFLNWGFPLNHFCKQLVKLYYPPRPEDLRRSVK 

HDSIPAADTFEDLSDVEGGGSEPTQRDAGSRDEKAEAGEAGQATAEAECHRTRETLPPFPTRLLELIEGFMLSKG 

LLTACKLKVFDLLKDEAPQKAADIASKVDASACGMERLLDICAAMGLLEKTEQGYSNTETAWYLASDGEY 

FIMHNNDLTWNLFTYLEFAIREGTNQHHRALGKKAEDLFQDAYYQSPETRLRFMRAMHGMTKLTACQVATAFNLS 

RFSSACDVGGCTGALARELAREYPRMQVTVFDLPDIIELAAHFQPPGPQAVQIHFAAGDFFRDPLPSAELYVLCR 

ILHDWPDDKVHKLLSKVAESCKPGAGLLLVETLLDEEKRVAQRALMQSLNMLVQTEGKERSLGEYQCLLELHGFH 

QVQWHLGGVLDAILPPKWPPEAQAACSL 
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FIGURE 23 

CGCCTCTCCCAAAGTCTAGCCGGGCAGGGGAACGCGGTGCATTCCTGACCGGCACCTGGCGAGGCTCATGCGTCC 

CGTGAGGGCGGTTCCTCGAGCCTGGGGGCGCTCAGATTGCTTTGGAGACGCTGAGAGAACCTTTGCGAGAGCGCC 

GGTTGACGTGCGGAGTGCGGGGCTCCGGGGGACTGAGCAGCACGAGACCCCATCCTCCCCTCCGGGTTTTCACAC 

TGGGCGAAGGGAGGACTCCTGAGCTCTGCCTCTTCCAGTAACATTGAGGATTACTGTGTXTTGTGAGAGCTCGCT 

AGGCGCCCTAAGCAACAGAGTTCTGAGAAATCGAGAAACMGATAAGGAATTGGCTGACTATTTTTATCCTTTTT 

CCCCTGAAGCTCGTAGAGAAATGTGAGTCAAGCGTCAGCCTCACTGTTCCTCCTGTCGTAAAGCTGGAGAACGGC 

AGCTCGACCAACGTCAGCCTCACCCTGCGGCCACCATTAAATGCAACCCTGGTGATCACTTTTGAAATCACATTT 

CGTTCCAAAAATATTACTATCCTTGAGCTCCCCGATGAAGTTGTGGTGCCTCCTGGAGTGACAAACTCCTCTTTT 

CAAGTGACATCTCAAAATGTTGGACAACTTACTGTTTATCTACATGGAAATCACTCCAATCAGACCGGCCCGAGG 

ATACGCTTTCTTGTGATCCGCAGCAGCGCCATTAGCATCATAAACCAGGTGATTGGCTGGATCTACTTTGTGGCC 

TGGTCCATCTCCTTCTACCCTCAGGTGATCATGAATTGGAGGCGGAAAAGTGTCATTGGTCTGAGCTTCGACTTC 

GTGGCTCTGAACCTGACAGGCTTCGTGGCCTACAGTGTATTCAACATCGGCCTCCTCTGGGTGCCCTACATCAAG 

GAGCAGTTTCTCCTCAAATACCCCAACGGAGTGAACCCCGTGAACAGCAACGACGTCTTCTTCAGCCTGCACGCG 

GTTGTCCTCACGCTGATCATCATCGTGCAGTGCTGCCTGTATGAGCGCGGTGGCCAGCGCGTGTCCTGGCCTGCC 

ATCGGCTTCCTGGTGCTCGCGTGGCTCTTCGCATTTGTCACCATGATCGTGGCTGCAGTGGGAGTGATCACGTGG 

CTGCAGTTTCTCTTCTGCTTCTCCTACATCAAGCTCGCAGTCACGCTGGTCAAGTATTTTCCACAGGCCTACATG 

AACTTTTACTACAAAAGCACTGAGGGCTGGAGCATTGGCAACGTGCTCCTGGACTTCACCGGGGGCAGCTTCAGC 

CTCCTGCAGATGTTCCTCCAGTCCTACAACAACGACCAGTGGACGCTGATCTTCGGAGACCCAACCAAGTTTGGA 

CTCGGGGTCTTCTCCATCGTCTTCGACGTCGTCTTCTTCATCCAGCACTTCTGTTTGTACAGAAAGAGACCGGGG 

TATGACCAGCTGAACTAGCACCCAGGGACCCAGTGTACCCAGCCTCTGGCCTCGTGCCCTGCTGGGGAAGGCCTC 

ACCCAGCGAAGGCCGGAGAAGCGGTTGGGCCCTGGCACACAGGGCTGGCTCAGTGTGCGGACAGAGGAGACCACT 

CTGCTCCTGGGGCCAGAGGCCATTCAATAGCCTGCCTTCGTCCGGGCCCCTCCTGGGCCTCCCCGGCCAGGCACG 

TGGCACCGTCGCCTXGACACCGCCATCTCTTTTCTTTAAGGCTTCAGGCAGCGCGCACAGGCTCTGGCAGCCGTC 

TCAGGCAGGACTGGGCACCAAGCTTGCAGCCGAAGGCCTTGCCCCAAACTACCAGCGTTTCTGCAAGCAGCTTGA 

AGGGCTGACCTTGCAGCCGGGTGAGCCAAGGGCACTTTGCTGCCACCGCTGCATTCCCAGAGATCAAGCAGCCCG 

GTGCCGTGGCCAGXGAACTCAGAGGTGCTGGTGGACGGGCTAGGACTTTGGGGTTAGGCCATGGGGCTCTTTCTC 

TGAAGGCCACTTTCCTGACGTACTCTCTGTACATAACTCAGCGTCCGTGACTGCAGXAACAGCCAGCCCTACCCA 

GAGTATTTCTGAGCCATGAGGGGCCCACCAGATTGGTTCTGAATTGGATTCATGCCCAGCGCATTAGCATAGTAA 

CTCCTTTCAGATTTTTTGGAGGGACGTTTGGAAGTGGCTTACTCTCTTCTGCCCTCTCXCCTACCTCCACCTTCT 

CAGATGAGCCCCATCXGAGCACAXCCAGCXGCXCCTTACCCAGCATCXGGAGXACAGGACATAGCTCTCXCCTGC 

XACCAGTCXGXGCCXXAGAGGTCGXXAGGCCTGCCAAACGGCGACCAGCTCCCCXGGAGCGAGGGCAGGCCCCTX 

CCCXCTCXXTCCCCAGACACCXACXTGAGACXCACCAAXTTCXGGCCXGTTCAGGAGCCXCAGATAAGXATTXGX 

ACTXGAGACCACCXCACACAAXCXGTAXGGGCCCAACCCXGAXCTCAAACCXCCXTCCCXCTGCCCAAAGCTGXC 

CTTCCTATGGCAGGAGGGGXGGGGGXCCCAGGACGXGCCXCATACAXGACXTGAGCXXGTCAGTCCACXGAGTTX 

CCXTCXACGAGAXCAACGCGAGGGGCCTGXAXCXTGAAXXAAAGCCXACXCGCTXCCXXXC 
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FIGURE 24 

MIRNWLTIFILFPLKLVEKCESSVSLTVPPWKLENGSSTNVSLTLRPPLNATLVITFEITFRSKNITILELPDE 
VWPPGVTNS SFQVTSQNVGQLTVYLHGNHSNQTGPRIRFLVIRSSAI S I INQVI GWI YFVAWS I SFYPQVIMNW 
RRKSVIGLSFDFVALNLTGFVAYSVFNIGLLWVPYIKEQFLLKYPNGVNPVNSNDVFFSLHAWLTLIIIVQCCL 
YERGGQRVSWPAIGFLVLAWLFAFVTMIVAAVGVITWLQFLFCFSYIKLAVTLVKYFPQAYMNFYYKSTEGWSIG 
NVLLDFTGGSFSLLQMFLQSYNNDQWTLIFGDPTKFGLGVFSIVFDWFFIQHFCLYRKRPGYDQLN 
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FIGURE 25 

CTTACAACTCCGCGCGGCCTCGGCCCCCTGCGCCGCCCGCCCCACAACAAAACTCAGCGCAGCGCTCCCGGGCGC 

CCGGTTCAGAGCGACCTGCGGCTCAGAGCGGAGGGGAGACTGACCGGAGCGCGGATCGGGACAGCGGCCGGGACA 

GCGGCGAGACGCGCGTGTGTGAGCGCGCCGGACCAAGCGGGCCCAGAAGCGGGTCTGCAGCCCAGAGGGCACCTT 

CTGCAAAC ATG TCTGTGGATCCCCTATCCAGCAAA6CTCTAAAGATCAAGCGAGAGCTGAGCGAGAACACGCCGC 

ACCTGTCGGACGAGGCGCTGATGGGGCTGTCGGTGCGCGAGCTGAACCGGCATCTGCGCGGGCTCTCCGCCGAGG 

AGGTGACACGGCTCAAGCAGCGGCGCCGCACACTCAAAAACCGTGGCTACGCCGCCAGCTGCCGCGTGAAGCGCG 

TGTGCCAGAAGGAGGAGCTGCAGAAGCAGAAGTCGGAGCXGGAGCGCGAGGTGGACAAGCTGGCGCGCGAGAACG 

CCGCCATGCGCCTGGAGCTCGACGCGCTGCGCGGCAAGTGCGAGGCGCTGCAGGGCTTCGCGCGCTCCGTGGCCG 

CCGCCCGCGGGCCCGCCACGCTCGTGGCGCCGGCCAGCGTCATCACCATCGTCAAGTCCACCCCGGGCTCGGGGT 

CTGGCCCCGCCCACGGCCCGGACCCCGCCCACGGCCCGGCCTCCTGCTCC3CAGTGCCCGCCCCCGCCATGCCTCA 

GCCACGCCCCTCCGGCCTCAGCTCCCTCCCCAAAGTGCCTGAGCGCCGCCTCTGTGCCCAGGTCCCATTTCTCTG 

CAGCACTGGCCCCTTGGTGCACACACATTCCCTTCGTGGGCCCTGTCTTCCTCTTGCAGCCCCCCAAACTGGGAC 

CGAATGACCCTGGGAAGGGGAAGTTGGGTAGGTTGGGGATGGGGCAGAGGTCTGGATCTGGGATCGCCCTTGGCT 

GAAAGTTTAGCCTTTTTAGATTGAGAGATACAGAGCCGGCTTAGAGAACAGCTGTTGGGGGAGAAGAGGGCACCC 

CTCATCTTGGAAACTGCTCTTATTGTGCCAATATGCCCTCCAAACCCTCCCAGGATTCAAAGCTAGGTTTGGCTG 

TCTGTGACTTACGGGACCGTCCTGCTGAGAAAXTGCACTGAAGAGATGCCCCCACCTCTGGTTGGGCCTGGGGGT 

GCCTGGCCTTCCGAAACTAAAAGAGTGGGTGGGAAGACTAGTGAAACCCAGTTCACGGATGGGGAAACAGGCCTG 

AGGTCACATTTCACTTAGTGGTTGTGTTGGGACCAAAACCTGGGTGTCCTCACTGCTGAGTCCAGCCATGGTTTT 

CAGGGGGACAGTGGACAGGGACTCAGAAATGTGGTGGGAGGGCCTCCCTGGCTTGGGAGACCGCTCTCTGCAAGG 

GAGGGGGAGAGAAGCAGAGGGAGAGAGAAGGTGACACGGATGGAAGAGTGGGAAGGAGCTGGCCTGGCTCAGCCC 

TAGGCTGTCCCTGCAGCCAGGGTGTCCGGGGGCTGGCCAGTCAGAGAAAGGGGGCCATGGACTGCTGTGGCAAAT 

AGGGAGACAAGGAGACAGACCCTGCAGTCCTACTACAGTCTGGAGTGGGGTCCTAAGAAGAAGGGTCCCACCTCA 

ACCCCTGTCAGTGTCCACTGTGGGGTGGGGGCTGACCCCTGCCTTTGATTGTCATTCTCCTGGGAAGCCCAGTCT 

CAGTCCCTCCCCCAACACTGTCCACACTGCCCCTCCCCACTGTTTATTTATTGCACGGATCTAAGTTATTCTCCC 

CAGCCAGAGCCCGAGCTCCTGCTCCCTGGGAAAAGTGGCGTATGGCCCTGAGCTGGGCXTTATATTATATATCTG 

CAAATAAATCACATTTTATCTTATATTTAGGGAAAGCCGGAGAGCAACAACAAAAAATGTTTAAGCCGGGCGCGG 

TGGCTCACATCTGTAAXCCCAGCACTTTGGGAGTCCAAGGAGGGGGATCGCTTGAGTCCAGGAGTTTGAGACCAG 

CCTGGACAACATGGTGAAACCCCGTCXCTACAAAAAATACAAAAATTAGCCATGCATGGTGGCTCATGCCTGTAG 

XCCCAGCXACTTGGGAGGCTGAGGCAGGAGGATCACTTAAGCCCAGAAGGCAGAGGTTGXAGTGAGCXGAGAXCG 

CACCACTGCACTCCAGCCXGGGCAACAXAGCAAAATCCXGXCTCAAAAAAAAAGXXAAAAAAXAXXGCCCGGCXC 

CTAGAAXTXAXTXAXTTCCXGACTTACAGCAAGCGAGTTATCGTCTXCXGTATTTTGTAGACXTTCXAAAXAAAG 

TCAAAXXCTXXCXXTXXCCACAGAAAAAAAAA 
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FIGURE 26 

MSVDPLSSKALKIKRELSENTPHLSDEALMGLSVRELNRHLRGLSAEEVTRLKQRRRTLKNRGYAASCRVKRVCQ 
KEELQKQKSELEREVDKLARENAAMRLELDALRGKCEALQGFARSVAAARGPATLVAPASVITIVKSTPGSGSGP 
AHGPDPAHGPASCS 
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FIGURE 27 

GCGGAGCGCGCGCTCCCAGCGAAAGCAGCAGGGCAGGGATCTGCGTTGGAGGAAGGGACTGCTCTGGTGCTAGAA 
TGCTGTGCGTCGGAAGGCTGGGCGGCTTGGGAGCCAGAGCAGCAGCTCTGCCGCCCCGCCGGGCGGGCCGGGGAA 
GCCTCGAAGCCGGGATCCGGGCCCGAAGGGTCAGCACCAGCTGGTCTCCCGTGGGCGCCGCCTTCAATGTCAAGC 
CCCAGGGCAGCCGCTTGGACCTGTTCGGCGAGCGGGCGCGTCXTTTTGGAGTTCCTGAGCTGAGTGCCCCAGAAG 
GATTXCATATTGCACAAGAAAAAGCCTTGAGAAAGACAGAATTGCTTGTGGACCGTGCATGTTCCACCCCACCTG 
GGCCCCAGACCGTGCTGATCTTCGATGAGCTCTCGGATTCCTTATGCAGAGTGGCCGACTTGGCTGATTTTGTGA 
AAATCGCTCACCCTGAGCCAGCATTCAGAGAAGCTGCGGAAGAAGCXTGTAGAAGTATTGGCACCATGGTAGAGA 
AGTTGAACACAAATGTGGATTTATATCAAAGTTTGCAAAAATTACTAGCTGATAAAAAACTTGTGGATTCCCTTG 
ATCCAGAAACAAGGCGAGTGGCTGAACTGTTTATGTTTGATTTTGAAATTAGTGGAATCCATCTAGACAAACAAA 
AGCGTAAAAGAGCAGTGGACCTCAATGTTAAAATCTTGGATTTGAGTAGTACATTTCTTATGGGAACCAATTTTC 
CCAACAAGATTGAGAAGCATCTCTTACCAGAACACATTCGTCGTAACTTTACATCTGCTGGGGATCATATCATAA 
TTGATGGTCTCCACGCAGAATCACCAGATGACTTGGTGCGAGAAGCTGCTTATAAAATTTTTCTTTATCCCAATG 
CTGGTCAATTGAAATGTTTAGAAGAATTGCTCAGCAGCAGAGATCTTCTGGCAAAGTTGGTGGGGTATTCCACGT 
TTTCTCACAGGGCTCTCCAAGGAACGATAGCTAAAAATCCAGAGACTGTCATGCAGTTCCTTGAAAAACTATCTG 
ACAAACTTTCTGAAAGAACTCTGAAAGATTTTGAGATGATACGAGGGATGAAAATGAAACTGAATGCTCAAAATT 
CCGAAGTAATGCCCXGGGACCCCCCTTACTACAGTGGTGTGATTCGTGCAGAAAGGTATAATATTGAGCCCAGCC 
TATATTGCCCGTTTTTCTCTCTTGGAGCATGCATGGAAGGCCTGAATATTTTGCTTAACAGACTGTTGGGGATTT 
CATTATATGCAGAGCAGCCTGCAAAAGGAGAGGTGTGGAGCGAAGATGTCCGAAAACTGGCTGTTGTTCATGAAT 
CTGAAGGATTGTTGGGGTACATTTACTGTGATTXTTTTCAGCGAGCAGACAAACCACATCAGGATTGCCATTTCA 
CXAXCCGXGGAGGCAGACTAAAGGAAGAXGGAGACTAXCAACTCCCACXTGTAGTXCTXAXGCXGAAXCTTCCCC 
GXTCCXCAAGGAGTTCXCCAACXTTGCTAACXCCXGGCATGAXGGAAAATCTTTTCCATGAAATGGGACAXGCCA 
XGCATXCAATGCXAGGACGXACXCGXTACCAACACGXCACXGGGACCAGGXGCCCTACXGAXXXXGCXGAGGXXC 
CTTCXAXTCTGATGGAGTACXXTGCAAAXGAXXAXCGAGTAGXXAACCAAXXTGCCAGACATTAXCAGACXGGAC 
AGCCACXGCCAAAAAAXATGGXGXCXCGTCXXXGXGAAXCXAAAAAGGTXXGXGCXGCAGCXGATATGCAACXTC 
AGGTCTTXXATGCCACXCTGGAXCAAATCTACCATGGGAAGCAXCCCCTGAGGAAXTCAACCACAGACAXTCTCA 
AGGAAACACAAGAGAAATXCXAXGGCCXACCATAXGTTCCAAAXACTGCCXGGCAGCXGCGAXXCAGCCACCXCG 
XGGGGXAXGGTGCTAGATAXXACXCTTACCXCAXGXCCAGAGCGGXCGCCXCCAXGGTXXGGAAGGAGXGXXXXC 
TACAGGAXCCXXTCAACAGGGCTGCCGGGGAGCGCXAXCGCAGGGAGATGCXGGCCCACGGTGGAGGCAGGGAGC 
CCAXGCTCATGGXTGAAGGXAXGCXXCAGAAGXGXCCTXCXGTTGAXGACTTCGXAAGTGCCCXCGTTTCCGACX 
XGGAXCXGGACXXCGAAACTXXCCXCAXGGATXCXGAATAaAAGAAACACXCXACACCTCTAAXCAAGGXCAXGX 
AGTAATGACTTTGTTATAAAXGCTACAGCXGTGAGAGCTTGTXXCXGATTGTTTCATTGXTCGCXXCTGXAAXTC 
XGAAAAACXXXAAACXGGXAGAACXXGGAAXAAAXAAXXXGTTTXAAXXAAAAAAAAAAAAAAAAAA 
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FIGURE 28 

MLCVGRLGGLGARAAALPPRRAGRGSLEAGIRARRVSTSWSPVGAAFNVKPQGSRLDLFGERARLFGVPELSAPE 
GFHIAQEKALRKTELLVDRACSTPPGPQTVLIFDELSDSLCRVADLADFVKIAHPEPAFREAAEEACRSIGTMVE 
KLNTNVDLYQSLQKLLADKKLVDSLDPETRRVAELFMFDFEISGIHLDKQKRKRAVDLNVKILDLSSTFLMGTNF 
PNKIEKHLLPEHIRRNFTSAGDHIIIDGLHAESPDDLVREAAYKIFLYPNAGQLKCLEELLSSRDLLAKLVGYST 
FSHRALQGTIAKNPETVMQFLEKLSDKLSERTLKDFEMIRGMKMKLNAQNSEVMPWDPPYYSGVIRAERYNIEPS 
LYCPFFSLGACMEGLNILLNRLLGISLYAEQPAKGEVWSEDVRKLAWHESEGLLGYIYCDFFQRADKPHQDCHF 
TIRGGRLKEDGDYQLPLWLMLNLPRSSRSSPTLLTPGMMENLFHEMGHAMHSMLGRTRYQHVXGTRCPTDFAEV 
PSILMEYFANDYRWNQFARHYQTGQPLPKNMVSRLCESKKVCAAADMQLQVFYATLDQIYHGKHPLRNSTTDIL 
KETQEKFYGLPYVPNTAWQLRFSHLVGYGARYYSYLMSRAVASMVWKECFLQDPFNRAAGERYRREMLAHGGGRE 
PMLMVEGMLQKCPSVDDFVSALVSDLDLDFETFLMDSE 
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FIGURE 29 

TTTGCAAATAGTAACGACAAAGATGATCAAGTTTTAAATTGCCATTTGGCAGTGAAGGTGCTATCCCCGGAAGAT 
GGAAAAGCAGATATTGTGAGAGCCGCTCAGGACTTTTGCCAGTTAGTAGCCCAGAAGCAAAGGAGACCCAAAGAT 
TTGGATGTAGATATGTTAGTTTACTCAGTTCAAATGGTTGTCCTGATCCTGATTTAGTACTGAAGTTCGGTCCTG 
TGGACAGCACACGAGGCTTTCTTACCTGGCACATCAGATTGACTGAGACTGTCTCTTTGCCTTCCCATCTAAACA 
TCAGTTATGAGGACTTTTTCTCTGCCCTTCGTCATTATGCAGCCTGTGAACAGCGTCTGGGAAAGTCGTGGTCAT 
TGGTTGCATAATTCCATTTGAGCTTATGGAGGAAAGGACCAAGTGACTCTGATTTTAGAAAGCACCTATGAAACC 
CTGTACACACCTATGAAACCCTGTACACACCXAGTTCATAATCTTCATAATTTATCAACAAACACAAAAAAGTGT 
CTTACTTGAGAGTGAGTGTGTGCGTGTGTGCGTGCACACATGTGCACGTTTGTATGTGTGGAAATAAACATAAAT 
GGGGACGTGTTGGAGAAGGAAATACATAGACCTACAACTTTGAGCATATAGCAGTGATGTTTTAGGAACTGAAAT 
GTCACACTTAATAAAGTCTTCAGCCCAGCTACTTCCCTGTTTTCGTGGGGAGAAGAGGGCCTGATTAGAACTGTT 
CTGGTTGTGTTTGGCGGGAGGGGAATAAATTTTGTTCAGTCCTTCTTAGTGACCAAACTTTAATTTTTAAGAATA 
ATATATTGACTTACTGAACTGAAGCATTCTGAGTTGAAAGGAGCTCCAGAGGAAAGGAGTTCTGTGTTGCTCACA 
TGTTAAAGCTTGCXCACCTTCAGAGCAGAGGAATACCTATCTACAGATATCCGCCCATTTTCATCTCTCTTCATT 
ATAGTCAAACAGTGTGACTTGAGAGTGTTG 
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FIGURE 30 

FANSNDKDDQVLNCHLAVKVLSPEDGKADIVRAAQDFCQLVAQKQRRPKDLDVDMLVYSVQMWLILI 
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FIGURE 31 

GAATTCTGCGGAGCCTGCGGGACGGCGGCGGGTTGGCCCGTAGGCAGCCGGGACAGTGTTGTACAGTGTTTTGGG 
CATGCACGTGATACXCACACAGTGGCTTCTGCTCACCAACAGATGAAGACAGATGCACCAACGAGGGTCTGGAAT 
GGTCTGGAGTGGTCTGGAAAGCAGGGTCAGATACCCCTGGAAAACTGAAGCCCGTGGAGCAATGATCTCTACAGG 
ACTGCTTCAAGGCTGATGGGAACCACCCTGTAGAGGTCCATCTGCGTTCAGACCCAGACGATGCCAGAGCTATGA 
CTGGGCCTGCAGGTGTGGCGCCGAGGGGAGATCAGCCATGGAGCAGCCACAGGAGGAAGCCCCTGAGGTCCGGGA 
AGAGGAGGAGAAAGAGGAAGTGGCAGAGGCAGAAGGAGCCCCAGAGCTCAATGGGGGACCACAGCATGCACTTCC 
TTCCAGCAGCTACACAGACCTCTCCCGGAGCTCCTCGCCACCCTCACTGCTGGACCAACTGCAGATGGGCTGXGA 
CGGGGCCTCATGCGGCAGCCTCAACATGGAGTGCCGGGTGTGCGGGGACAAGGCATCGGGCTTCCACTACGGTGT 
TCATGCATGTGAGGGGTGCAAGGGCTTCTTCCGTCGTACGATCCGCATGAAGCTGGAGTACGAGAAGTGTGAGCG 
CAGCTGCAAGATTCAGAAGAAGAACCGCAACAAGTGCCAGTACTGCCGCTTCCAGAAGTGCCTGGCACTGGGCAT 
GTCACACAACGCTATCCGTTTTGGTCGGATGCCGGAGGCTGAGAAGAGGAAGCTGGTGGCAGGGCTGACTGCAAA 
CGAGGGGAGCCAGTACAACCCACAGGTGGCCGACCTGAAGGCCTTCTCCAAGCACATCTACAATGCCTACCTGAA 
AAACXTCAACATGACCAAAAAGAAGGCCCGCAGCATCCTCACCGGCAAAGCCAGCCACACGGCGCCCTTTGTGAT 
CCACGACATCGAGACATTGTGGCAGGCAGAGAAGGGGCTGGTGTGGAAGCAGTTGGTGAATGGCCTGCCTCCCTA 
CAAGGAGATCAGCGTGCACGTCTTCTACCGCTGCCAGTGCACCACAGTGGAGACCGTGCGGGAGCTCACTGAGTT 
CGCCAAGAGCATCCCCAGCXTCAGCAGCCTCXTCCTCAACGACCAGGTTACCCTTCTCAAGTATGGCGTGCACGA 
GGCCAXCTXCGCCAXGCTGGCCXCXAXCGXCAACAAGGACGGGCTGCXGGXAGCCAACGGCAGXGGCTTTGXCAC 
CCGXGAGXXCCTGCGCAGCCTCCGCAAACCCTXCAGTGAXAXCATTGAGCCTAAGTTTGAAXXXGCTGTCAAGTT 
CAACGCCCTGGAACTXGAXGACAGXGACCXGGCCCXAXXCAXXGCGGCCAXCATXCXGXGXGGAGACCGGCCAGG 
CCXCAXGAACGTXCCACGGGXGGAGGCTAXCCAGGACACCATCCXGCGXGCCCXCGAAXTCCACCXGCAGGCCAA 
CCACCCXGAXGCCCAGXACCXCTXCCCCAAGCTGCXGCAGAAGATGGCXGACCTGCGGCAACXGGTCACCGAGCA 
CGCCCAGATGATGCAGCGGAXCAAGAAGACCGAAACCGAGACCTCGCTGCACCCXCXGCXCCAGGAGAXCXACAA 
GGACAXGXACTAACGGCGGCACCCAGGCCTCCCTGCAGACXCCAAXGGGGCCAGCACXGGAGGGGCCCACCCACA 
XGACTXXXCCAXTGACCAGCXCXCXXCCXGXCXXXGXXGTCTCCCXCXXXCTCAGXTCCXCXXXCTXTXCXAAXT 
CCTGTXGCXCXGTTTCXTCCTXXCXGTAGGXTTCXCXCTTCCCXXCXCCCTTCTCCCTXGCCCTCCCTTTCXCTC 
TCCXAXCCCCACGXCXGXCCTCCXXTCXXAXTCTGTGAGAXGXXTTGXAXXAXTXCACCAGCAGCAXAGAACAGG 
ACCXCXGCXXXTGCACACCXXXXCCCCAGGAGCAGAAGAGAGXGGGCCXGCCCTCXGCCCCATCAXXGCACCTGC 
AGGCTTAGGXCCTCACTTCXGTCTCCTGXCTXCAGAGCAAAAGACXXGAGCCATCCAAAGAAACACXAAGCXCTC 
XGGGCCXGGGXXCCAGGGAAGGCTAAGCATGGCCTGGACXGACXGCAGCCCCCTAXAGXCAXGGGGXCCCXGCXG 
CAAAGGACAGXGGCAGACCCCGGCAGXAGAGCCGAGAXGCCTCCCCAAGACTGTCAXTGCCCCXCCGATCGXGAG 
GCCACCCACXGACCCAAXGAXCCTCTCCAGCAGCACACCXCAGCCCCACTGACACCCAGTGTCCTTCCAXCTTCA 
CACXGGXXXGCCAGGCCAAXGXXGCXGAXGGCCCCXCCAGCACACACACAXAAGCACXGAAAXCACTXTACCXGC 
AGGCACCATGCACCXCCCXXCCCXCCCXGAGGCAGGXGAGAACCCAGAGAGAGGGGCCXGCAGGXGAGCAGGCAG 
GGCXGGGCCAGGTCTCCGGGGAGGCAGGGGTCCXGCAGGXCCTGGXGGGTCAGCCCAGCACCTCGCCCAGXGGGA 
GCXXCCCGGGAXAAACXGAGCCXGXTCAXXCTGAXGXCCATXXGXCCCAAXAGCXCXACXGCCCXCCCCXXCCCC 
TXXACXCAGCCCAGCXGGCCACCXAGAAGXCXCCCTGCACAGCCXCXAGXGXCCGGGGACCXTGXGGGACCAGTC 
CCACACCGCXGGXCCCTGCCCTCCCCTGCTCCCAGGTTGAGGTGCGCTCACCXCAGAGCAGGGCCAAAGCACAGC 
XGGGCAXGCCAXGXCXGAGCGGCGCAGAGCCCXCCAGGCCXGCAGGGGCAAGGGGCXGGCXGGAGXCXCAGAGCA 
CAGAGGXAGGAGAACTGGGGXXCAAGCCCAGGCTXCCXGGGTCCTGCCXGGTCCTCCCTCCCAAGGAGCCAXXCX 
AXGXGACXCXGGGTGGAAGTGCCCAGCCCCTGCCTGACGGXXXXXXXGATCACTCTCTGCTGGCAGGATTCTTCC 
CGCXCCCCACCXACCCAGCXGAXGGGGGTTGGGGTGCXXCXTTCAGCCAAGGCXAXGAAGGGACAGCXGCTGGGA 
CCCACCXCCCCCCXTCCCCGGCCACAXGCCGCGXCCCXGCCCCCACCCGGGTCXGGXGCXGAGGATACAGCXCXX 
CXCAGXGXCXGAACAAXCXCCAAAAXTGAAATGTATAXXXXXGCXAGGAGCCCCAGCXXCCTGTGTXXXTAATAX 
AAAXAGXGXACACAGACXGACGAAACXXXAAAXAAAXGGGAAXXAAATAXXXAAAAAAAAAAGCGGCCGCGAAXX 
C 
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FIGURE 32 

MEQPQEEAPEVREEEEKEEVAEAEGAPELNGGPQHALPSSSYTDLSRSSSPPSLLDQLQMGCDGASCGSLNMECR 
VCGDKASGFHYGVHACEGCKGFFRRTIRMKLEYEKCERSCKIQKKNRNKCQYCRFQKCLALGMSHNAIRFGRMPE 
AEKRKLVAGLTANEGSQYNPQVADLKAFSKHIYNAYLKNFNMTKKKARSILTGKASHTAPFVIHDIETLWQAEKG 
LVWKQLVNGLPPYKEISVHVFYRCQCTTVETVRELTEFAKSIPSFSSLFLNDQVTLLKYGVHEAIFAMLASIVNK 
DGLLVANGSGFVTREFLRSLRKPFSDIIEPKFEFAVKFNALELDDSDLALFIAAIILCGDRPGLMNVPRVEAIQD 
TILRALEFHLQANHPDAQYLFPKLLQKMADLRQLVTEHAQMMQRIKKTETETSLHPLLQEIYKDMY 
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FIGURE 33A 

GGACACGGAGCCGCGAGGAGACAGCTGAGGCCCGCGGAGACCAGGGGGTGAAGCCTGGAGACCCTCTTGCCCTGG 
CCTAGCTGCAGGCCCCCGGGATGCTTTGGGCATGTCCTCTGGAGCCCCACAGAAGAGCAGCCCAATGGCCAGTGG 
AGCTGAGGAGACCCCAGGCTTCCTGGACACGCTCCTGCAAGACTTCCCAGCCCTGCTGAACCCAGAGGACCCTCT 
GCCATGGAAGGCCCCAGGGACGGTGCTCAGCCAGGAGGAGGTGGAGGGCGAGCTGGCTGAGCTGGCCATGGGCTT 
TCTGGGCAGCAGGAAGGCCCCGCCACCACTTGCTGCTGCTCTGGCCCACGAAGCAGTTTCACAGCTGCTACAGAC 
AGACCTTTCCGAATTCAGGAAGTTGCCCAGGGAGGAAGAAGAAGAGGAGGAGGACGATGACGAGGAGGAAAAGGC 
CCCTGTGACCTTGCTGGATGCCCAAAGCCTGGCACAGAGTTTCTTTAACCGCCTTTGGGAAGTCGCCGGCCAGTG 
GCAGAAGCAGGTGCCATTGGCTGCCCGGGCCTCACAGCGGCAGTGGCTGGTCTCCATCCACGCCATCCGGAACAC 
TCGCCGCAAGATGGAGGACCGGCACGTGTCCCTCCCTTCCTTCAACCAGCTCTTCGGCTTGTCTGACCCTGTGAA 
CCGCGCCTACTTTGCTGTGTTTGATGGTCACGGAGGCGTGGATGCTGCGAGGTACGCCGCTGTCCACGTGCACAC 
CAACGCTGCCCGCCAGCCAGAGCTGCCCACAGACCCTGAGGGAGCCCTCAGAGAAGCCTTCCGGCGCACCGACCA 
GATGTTTCTCAGGAAAGCCAAGCGAGAGCGGCTGCAGAGCGGCACCACAGGTGTGTGTGCGCTCATTGCAGGAGC 
GACCCTGCACGTCGCCTGGCTCGGGGATTCCCAGGTCATTTTGGTACAGCAGGGACAGGTGGTGAAGCTGATGGA 
GCCACACAGACCAGAACGGCAGGATGAGAAGGCGCGCATTGAAGCATTGGGTGGCTTTGTGTCTCACATGGACTG 
CTGGAGAGTCAACGGGACCCTGGCCGTCTCCAGAGCCATCGGGGATGTCTTCCAGAAGCCCTACGTGTCTGGGGA 
GGCCGATGCAGCXXCCCGGGCGCTGACGGGCTCCGAGGACTACCTGCTGCTTGCCTGTGATGGCTTCTTXGACGT 
CGTACCCCACCAGGAAGTTGTTGGCCTGGTCCAGAGCCACCTGACCAGGCAGCAGGGCAGCGGGCTCCGTGTCGC 
CGAGGAGCTGGTGGCTGCGGCCCGGGAGCGGGGCTCCCACGACAACATCACGGTCATGGTGGTCTTCCTCAGGGA 
CCCCCAAGAGCTGCTGGAGGGCGGGAACCAGGGAGAAGGGGACCCCCAGGCAGAAGGGAGGAGGCAGGACTTGCC 
CTCCAGCCXTCCAGAACCTGAGACCCAGGCTCCACCAAGAAGCTAGGTGGTTTCCAGGCCCCTGCCCTCCCCTTC 
CTCCCATCCTTGTCCXXCTCTCCCXCAGAAGCCTCAGGACCCAACAGGXGGCAGGCAGTGGACAGGGTGCCCGCC 
CCACAGTGCXTTCCCCAGCACCCCAGAGCCAGXCGGGACACCCCCCGCAGCCCGTCCXGGXGGCXGXGGAACXGC 
ACXGGGXGGCGGGCAGAXGGXGGAAGGCAGCXXAGGAGACCTCACCAAAGAGAAGATGGACCGGCXCTTGCTCCC 
AGCXCCTAXTAGGCCCGGGGTGGGACCAGAGGTCAXAGGXGCCCAACGGCAGCCAAACCAAAGACACTGGXGTGC 
AXGGGGCAGCAXGGTTGXGCACGXGGGACCCTGGGGCGGACCCAGGAGCCAAACTCTXGAAGCACCCCCTGGGTC 
AGGCCCAGCAGCGGAGXGGCCAGCCCCAGXXXCCCATTGCXCCTCTCTGCGGCCAGGGCCAGGXGGGTTCAXAXX 
TACAGATATGCCCAGCCAGXCCXGGXCGGCCACACCAGTGXCCCAAAGAGGAGAGCGCAGCAGAGCCAGGGGTCT 
GTXCTGTAGCAGCCACCCCCCXGCCCCCACTCCAGGGCAGCCAXGATGTGCTXGGCCCACCAGGGCCXXCCGGGC 
XGCTCXCXXCCCTGAGCCCGGAACCGGCGACGCACATGXGXCXTTTGXXGGTGTGTTXGXTXXXTTCCAGGGAGG 
XCTAAXXCCGAAGCAGXATTCCAGGXXXTCTCTTTGXXXXAXCAGTGCCAAGATGACCXGTXGTGTCATAXAATT 
TAAGCAGAGCTXAGCAXXXAXTTTATTCTXTAGAAAACTTAAGXATTTACTXXTXXAAAGCTATTTTTCAAGGAA 
CCTXXXXXTGCAGXAXTATTGAATXXAXXXXCTAAAXCAGGAXXGAAACAGGAACTTTXCCAGGXGGTGXXAATA 
AGCCAXXCAAGTGCCXXACACAGCTXXGAAGAAACXAGGACXGCAGXGGGCTCGGATAGGCCCATXGAGGXXXTT 
AGAAAAGCAGGAXTTGTTTTGTTAGGGAGGCAXGAXTTTGGTGAGATCXTTCXGGAAGAGTTTTCCGCCXCTXTG 
XGAXGCXGAACACCCCCAAGGXXCXCCCCXCCCCCCGCXGCCCAGGXGACXGGCAGGAGCXGCGACXGCCACGTA 
GXGTTGCCXGGGCCCGACAGCGGGGCTCTGGGCATCCCGGGXGACCTXGGCCCAXCXGCCTGCAXXCCCACCCCC 
XXGGGCCXGGCXGGATCCCAGGCAGAGGGACCTTGCTGCTGTGTGATTGGAACAXTCCCAAATATCTTGTGAAXX 
TGXAAXCAAATTGGXCXCATTGGGAAAGACXCXTAAXXAAGAGGCTCAGGCAAGCACAGAGGCAGCCCGXGGGXC 
TCTGXCTCAGXCTGGAGGCAGCAGGGATGCXGCXGGGAGTCCAXGGCACAGGCCACAGCCCCTCACCTTGCCGCG 
GXGGCXGGCAGCACGCCTGCCXTGCXCXGCCCCATGCCCTGAACAGGCAXGAGAGCXCCACGXCCCCXAGXGCAC 
CCTGAGAGGGGGCTCACAAGXGACCGAXCCTGGGXGCCTCAGGGAGCXCACTGAGGGCGTGCAAAGTTGAAAGXG 
GCAAGGCTGGGGGAGGGXGTCGGGTAGAGGGAAGAGGGCAGGGGGCTAGGGGAGGACTCAGAGGCCAXCTGCAGG 
GCCAAGCCACAGGAAGGGCXGAGCTGGAGGTGGGCAGGGCXGCTCCAGGCAGGXCAGAGCAGXGCAGGGGGAGGA 
GAGGAGAAAGGGAGGAAGCTGGGCTGTGTGGTCCCCATGAAGGCAXXCAGAGTCCACCXGCAGACAGCGAGAGCC 
CCAGGAAGGTTXGCACAGCXGXGCCCCAAGCACCTXGGCCXCCTCXCAGCXCGCCGAGGAGGCACGCTAGAGCCG 
CCTTCCCGGXGGGAGCCCXCTGXCCCACAGGGAGCGGGGAGCCAGCTXXGCXGGGGCCCTACCXGCAXGCCCAGC 
CTTACCCCXCAXTCTCACAGCACAGATGAGGXTGAGACCATGCAGXCAATGCATTGCTTAAGGXCTCTTATTTAC 
AAAAAAAAACCXXAAACAXAGXCGCXGXCAXXCAGACAXXCAGAGAAXGGXTGGCCACAAACAAXGACCAAGXAX 
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FIGURE 33B 

TGCTTGGCTTAACTTGAAGGCCTGCTGTCTCCTTCTGGGGGTCAGGGACGCAGCTCCACCCTCACCACTAGCCCA 
CCCTGCCCGTGGGCATAACCTTGACGAAGAGAGAGAATGATTGGCATCTGCTTTTCTCTTTTCTTTGCTAATAAT 
TCTGTTCCTGGCTGCCGAGAGTGAAGTTTCACCATGTGGAGGTTTGGCTCCTATCACCTGGTGGTCTGATTCATA 
CCCTAGCCTGAGGCTCCACTGGAAGATCTCGCAGCCTCAGTGTATGGGAAACCCTTTCCCCAGGCTTGTCCCAGC 
ACTGCCGCTCCCCACCCCTGAGCCAGGACCCCAGAGGATGGCCATGCCCCGTGCCTGGCAGAGGTCTGGTGCCAG 
CACTGGGAGCTGCTCCGCCCTTGCCTTGGGGCCGAGGGAGCCCTCGTCCACCCCTGCACAGCAGCTGGGCACAGA 
GGAGCGCTCTTCCATCTTGACCAGGACTGCACCAAGAAGCACCAGGTGTCTTCAGCCTCCAACCTCCGGGGCGAC 
CTTCTCTTCCAGCCACAGTCCCATGAGGGCCCCTAGCCAGGGACACTGGTCTGTAAATTGTAATCCTTTCTCCAG 
CCCAGCTCTCCACTTGTTCCTTGTGTGAGCTGAGCAGGCAGTGCACCTCTGAGTGTCCCTTTTGTAAGGCCCAGG 
GGTTGCACTGAGTCTGCAGAGGCCGCGACCTCCTAGAACGCTGTGGGTGCAAGTGAGCCGGCGTGTCCTGGGGAG 
ATGCTGCCAGCACACAGGGGCCCTCCTGCTGCCAGCAGGTTGGGGTGGTTAAGTCTTATTAGTGTCTATTCTTAA 
AATTAAGTGGGCTGGAGAAGAATGGAGCTCCACATGCCAGCACCGTATATGGAATACAAAAGCTGGGGAAGCAGG 
GCCTGCCTTACAGGTGTGGCTGACTCTGAGCCCAGGCCTGCAGGGGTGGAGGGCAGTCCCTCAGAATCCCAGAGG 
CAGTCCCAGCCTCAGAACCCAGGATAGGAAATGGGTGTGTTTAGTGGGGAAAGGGACGGGGTGCAGACGGCAGGG 
CCAGTATGGGGCCCCCTCCCTCTCCTCTCCTCTCCTATGGTGAGCCCAGCGTGGGCACCGGGCCGTCTCAGCCGT 
GTTCCCAGGGCTGGGAGGACAGCTCTGGCCCTTCTTAGGCCTAGCCTCGTCCCAAGCTAAATGTAAGCCAGTTGG 
GCTGTGTTAAAGGAAGCAGTGTTTTTGGTTTGATTCTGCCTCTGTAGCTCAAGGGGGGCAGCCCCCAGAGTCCTG 
TGCATTCTGCCAAGGCTCCATAGCTTTGCCAAATGCACGGAGCTCTGCCATTCCGGTGCAGTGCAGGCCTTGCGA 
AGGGTTTATCTGCGTTCGTCTCGGTGGGCTTCTCCTGCATGGGAGTTGTGTTCCTGTGCAAGGGGGAGCTTTGCT 
CAGGACAGGATGACTGTCTTCCCTATTCTTAGGGACAAGTCCCAAGATGCCAGAAAGGCAGTCTCCCAAGGACCC 
ACCATGCAGAAGTGTCAATAAACCACAAGTTCTG 
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FIGURE 34 

MSSGAPQKSSPMASGAEETPGFLDTLLQDFPALLNPEDPLPWKAPGTVLSQEEVEGEIiAELAMGFLGSRKAPPPL 
AAALAHEAVSQLLQTDLSEFRKLPREEEEEEEDDDEEEKAPVTLLDAQSLAQSFFNRLWEVAGQWQKQVPLAARA 
SQRQWLVSIHAIRNTRRKMEDRHVSLPSFNQLFGLSDPVNRAYFAVFDGHGGVDAARYAAVHVHTNAARQPELPT 
DPEGALREAFRRTDQMFLRKAKRERLQSGTTGVCALIAGATLHVAWLGDSQVILVQQGQWKLMEPHRPERQDEK 
ARIEALGGFVSHMDCWRVNGTLAVSRAIGDVFQKPYVSGEADAASRALTGSEDYLLLACDGFFDWPHQEWGLV 
QSHLTRQQGSGLRVAEELVAAARERGSHDNITVMWFLRDPQELLEGGNQGEGDPQAEGRRQDLPSSLPEPETQA 
PPRS 
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FIGURE 35 

ATG GCGGCGGCCGTAGCGGCTCCACTCGCCGCC66GGGTGAGGAGGCGGCAGCCACGACCTCCGTGCCCGGGTCT 

CCAGGTCTGCCGGGGAGACGCAGTGCAGAGCGGGCCCTAGAGGACGCCGTGGCCACCGGGACCCTGAACCTGTCT 

AACCGGCGCTTGAAGCACTTCCCCCGGGGCGCGGCCCGTAGCTACGACCTGTCAGACATCACCCAGGCXGACCTG 

TCCCGGAACCGGTTTCCCGAGGTGCCCGAGGCGGCGTGCCAGCTGGTGTCCCTGGAGGGCCTGAGCCTCTACCAC 

AATTGCCTGAGATGCCTGAACCCAGCCTTGGGGAATCTCACAGCCCTCACCTACCTCAACCTCAGCCGAAACCAG 

CTGTCGCTGCTGCCACCCTACATCTGCCAGCTGCCCCTGAGGGTCCTCATCGTCAGCAACAACAAGCTGGGAGCC 

CTGCCCCCXGACATCGGCACCCTGGGAAGCCTGCGACAGCTTGACGTGAGCAGCAACGAGCTCCAATCCCTGCCC 

TCGGAACTGTGTGGCCTCTCTTCCCTGCGGGACCTCAATGTCCGGAGGAACCAGCTCAGTACGCTGCCCGAAGAG 

CTGGGGGACCTCCCTCTGGTCCGCCTGGATTTCTCCTGTAACCGCGTCTCCCGAATCCCAGTCTCCTTCTGCCGC 

CTGAGGCACCTGCAGGTCATTCTGCTGGACAGCAACCCTCTGCAGAGTCCACCTGCCCAGGTCTGCCTGAAGGGG 

2UVACTTCACATCTTCAAGTATTTGTCCACAGAGGCCGGGCAGCGTGGGTCGGCCCTGGGGGACCTGGCCCCTTCT 

CGGCCCCCGAGTTTCAGTCCCTGCCCTGCAGAGGATCTATTTCCGGGACATCGGTACGATGGTGGGCTGGACTCA 

GGCTXCCACAGCGTTGATAGTGGCAGCAAGAGGTGGTCTGGAAATGAGTCAACAGATGAATXTTCAGAGCTGTCA 

XXCCGGAXCXCAGAGCXGGCCCGGGAGCCCCGGGGGCCCAGAGAACGCAAGGAGGAXGGCXCAGCGGACGGAGAC 

CCXGTGCAGATTGACXXCATCGACAGCCAXGTCCCCGGGGAGGAXGAAGAGCGAGGCACXGTGGAGGAGCAGCGA 

CCACCCGAAXXAAGCCCXGGGGCAGGGGACAGGGAGAGGGCACCAAACAGCAGGCGGGAGGAGCCGGCAGGGGAG 

GAGCGGCGGCGCCCGGACACCXXGCAGCXGXGGCAGGAGCGGGAACGGCGGCAGCAGCAGCAGAGCGGGGCGXGG 

GGGGCCCCGAGGAAGGATAGCCXCTXGAAGCCAGGGCTCAGGGCTGTTGXGGGAGGGGCCGCCGCCGXGXCCACX 

CAAGCCAXGCACAACGGCXCGCCXAAGXCCAGXGCCXCCCAAGCAGGGGGCXGCAGCGGGGCAGGGAGCCCCGCC 

CCTGCCCCXGCCXCCCAAGAGCCCCXXCCCAXAGCTGGACCAGCGACAGCACCXGCXCCACGGCCACXXGGCXCC 

AXXCAGAGACCAAACAGCXXCCXCXXCCGXXCCXCCXCTCAGAGTGGCXCAGGCCCXXCCXCACCAGACXCTGTC 

CXGAGACCTCGGCGGTACCCCCAGGXTCCAGAXGAGAAGGACXXAAXGACXCAGCXGCGCCAGGXCCXXGAGXCC 

CGGCXGCAGCGGCCCCXGCCXGAGGACCTGGCCGAGGCXCXGGCCAGXGGGGXCAXCCXGXGCCAGCTGGCCAAC 

CAGCXACGGCCGCGCTCCGXGCCCXXCAXCCAXGXGCCCXCCCCXGCXGXGCCAAAACTCAGXGCCCXCAAGGCX 

CGGAAGAATGTGGAGAGTXXXCXAGAAGCCXGXCGAAAAAXGGGGGTGCCXGAGGCTGACCXGXGCXCGCCCXCG 

GAXCXCCXCCAGGGCACXGCCCGGGGGCXGCGGACCGCGCXGGAGGCCGTGAAGCGGGXGGGGGGCAAGGCCCXA 

CCGCCCCXCXGGCCCCCCXCXGGXCXGGGCGGCXXCGXCGXCXTCXACGTGGXCCTCAXGCXGCXGCXCXAXGXC 

ACCTACACXCGGCXCCTGGAXCCCCGTXCCCCCCAGGTGGCCXGGGAGGXGGCCCCCXCGAGGATGACXCCACXA 

GCGCCCXGGGACCCCAAGXAXGAAGCCAAAGCAGGACCXCGGCCGGXGXGGGXGAGXXGGGGGCAAACCXGXGGG 

ACXGGCXGGGGXGCXCAGGGAGCXGTGCGGXGGCCXGAGGCXCCAGXGCXCXGXCCXCCXCACCCTAGGGGGCCA 

ACXGXAGCXCAGGAGCCXCGXTCXCAGGCCGGACGCTGXGXCACCCCTCATTCTGGCCGCXGTAXGAAGCAGCCX 

CGGGCAGGGGXCXCAGGCCCGXGGCCCCXGCCACAGGGCACXGGAAXGGACAGCAGGCGCCCCCAGATGCAGGGX 

XCCCGGXGGXGXGCXGXGAAGAXGXCXXCCXCXCGGACCCXCXGCXGCCCCGGGGGCAGCGXGXXCCCCXGXACC 

XGTCCAAGGCCCCCCAGCAGATGATGGGCTCCCXGAAACXGCTGCCGCCGCCCCCCAXCAXGTCXGCCAGGGTGC 

XCCCCCGCCCAXCACCCXCCCGGGGCCCCXCCACXGCCXGGCXCAGCGGGCCGGAGCXGAXCGCXCXCACXGGCC 

XGCXGCAGAXGAGCCAGGGGGAGCCXAGGCCCAGCXCCXCCGCGGXTGGCCCCCCAGACCAXACCXCXGACCCAC 

CCAGCCCCXGXGGXAGCCCCAGCAGTTCTCAGGGTGCTGACCXCXCXCTCCCACAGACCCCAGACACCCAXXGXC 

CAXAGCCXXCXCAGGGCAGAGXGGGCXGGXXGXGXXGACAAXAAAACAGXGXXGGXXXGCA 
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FIGURE 36 

MAAAVAAPLAAGGEEAAATTSVPGSPGLPGRRSAERALEDAVATGTLNLSNRRLKHFPRGAARSYDLSDITQADL 
SRNRFPEVPEAACQLVSLEGLSLYHNCLRCLNPALGNLTALTYLNLSRNQLSLLPPYICQLPLRVLIVSNNKLGA 
LPPDIGTLGSLRQLDVSSNELQSLPSELCGLSSLRDLNVRRNQLSTLPEELGDLPLVRLDFSCNRVSRIPVSFCR 
LRHLQVILLDSNPLQSPPAQVCLKGKLHIFKYLSTEAGQRGSALGDLAPSRPPSFSPCPAEDLFPGHRYDGGLDS 
GFHSVDSGSKRWSGNESTDEFSELSFRISELAREPRGPRERKEDGSADGDPVQIDFIDSHVPGEDEERGTVEEQR 
PPELSPGAGDRERAPNSRREEPAGEERRRPDTLQLWQERERRQQQQSGAWGAPRKDSLLKPGLRAWGGAAAVST 
QAMHNGSPKSSASQAGGCSGAGSPAPAPASQEPLPIAGPATAPAPRPLGSIQRPNSFLFRSSSQSGSGPSSPDSV 
LRPRRYPQVPDEKDLMTQLRQVLESRLQRPLPEDLAEALASGVILCQLANQLRPRSVPFIHVPSPAVPKLSALKA 
RKNVESFLEACRKMGVPEADLCSPSDLLQGTARGLRTALEAVKRVGGKALPPLWPPSGLGGFWFYWLMLLLYV 
TYTRLLDPRSPQVAWEVAPSRMTPLAPWDPKYEAKAGPRPVWVSWGQTCGTGWGAQGAVRWPEAPVLCPPHPRGP 
XVAQEPRSQAGRCVTPHSGRCMKQPRAGVSGPWPLPQGXGMDSRRPQMQGSRWCAVKMSSSRTLCCPGGSVFPCT 
CPRPPSR 
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FIGURE 37 

GCGCGGCCC ATG TCCTCCGCGCCGCGCTCGCCCACCCCGCGGCCCCGCAGGATGAAGj^GGACGAGTCGTTCCTG 
GGCAAGCTGGGCGGCACCCTGGCCAGGAAGCGGAGGGCGCGCGAGGTGAGTGACCTGCAGGAAGAAGGCAAGAAT 
GCCATCAACTCACCGATGTCCCCCGCCCTGGTGGATGTTCACCCTGAAGACACCCAGCTTGAGGAGAACGAGGAG 
CGCACGATGATTGACCCCACTTCCAAGGAAGACCCCAAGTTCAAGGAACTGGTCAAGGTCCTCCTCGACTGGATT 
AATGACGTGCTGGTGGAGGAGAGGATCATTGTGAAGCAGCTGGAGGAAGACCTGTATGACGGCCAGGTGCTGCAG 
AAGCTCTTGGAAAAACTGGCAGGGTGCAAGCTGAATGTGGCTGAGGTGACACAGTCCGAAATAGGGCAGAAACAG 
AAGCTGCAGACGGTGCTGGAAGCAGTACATGACCTGCTGCGGCCCCGAGGCTGGGCGCTCCGGTGGAGCGTGGAC 
TCAATTCACGGGAAGAACCTGGTGGCCATCCTCCACCTGCTGGTCTCTCTGGCCATGCACTTCAGGGCCCCCATC 
CGCCTTCCTGAGCATGTAACGGTGCAGGTGGTGGTCGTGCGGAAACGGGAAGGCCTGCTGCATTCCAGCCACATC 
TCGGAGGAGCTGACCACAACTACAGAGATGAXGATGGGCCGGTTCGAGCGGGATGCCTTCGACACGCTGTTCGAC 
CACGCCCCGGATAAGCTCAGCGTGGTGAAGAAGTCTCTCATCACTTTTGTGAACAAGCACCTGAACAAGCTGAAT 
TTGGAGGTGACGGAACTGGAGACCCAGTTTGCAGATGGCGTGTACCTGGTTCTGCTCATGGGCCTTCTGGAAGAC 
TACTXTGTTCCTCTCCACCACTTCTACCTGACTCCGGAAAGCTTCGATCAGAAGGTCCACAATGTGTCCTTCGCC 
TTTGAGCTGATGCTGGACGGAGGCCXCAAGAAACCCAAGGCTCGTCCTGAAGACGTGGTTAACTTGGACCTCAAA 
TCCACCCTGAGGGTTCTTTACAACCTGTTCACCAAGTACAAGAACGTGGAGTGACGGGGGAGCTGTGGATGGTGG 
CAGGAGTGTCCCAGCAAGAAAGGCGGCATCCGTCTGTGCCCTGTGCCTTTCCAGGGAGCCAGGCGCCATGGGCTT 
CTGGTCCAAGCTGTGTTGACTGTCATCCCCACCCCACCCCTACCTCACGCCTGCCCCACCCCCTGCCTCTTTTGG 
TTGTTGTTCTTAATCTCCTCTCCATGTAGTTCCCAGTGGGCAAGAGCCTTTGAAAATGCAGGATTCTAAACACTC 
GTGCTTGCGTTTGAAGCCTCGCGTCACTCAGTCGCGTGGGATGATGAGTCCGTTGTTGTCTGCCTTGGCCGAAAG 
ATGAAAAAAAGCCTGAACCCCAACCCCCAGCTGGTTGAGAGCACCCTGCATTCTGCCTCATGGTGCAGTTAGCGA 
TCACAGGCCTTCAGAAGTACAACAXCAGCTCAGCAGGAACGCCGGCTCCCCGAGGACCCAGGCTTGACGATTCAC 
CGGGGATCXCCTGGGCXGGGCXCTCCTXGGAAGXGAGGCCXTXXAXXAAAAATAAAAGGGXTTTGCAGXXXGAAA 
AAAAAAAAAAAAAAAAAAAA 



wo 2004/041170 



PCT/US2003/034312 



41/2825 

FIGURE 38 

MSSAPRSPTPRPRRMKKDESFLGKLGGXLARKRRAREVSDLQEEGKNAINSPMSPALVDVHPEDTQLEENEERTM 

IDPTSKEDPKFKELVKVLLDWINDVLVEERIIVKQLEEDLYDGQVIiQKLLEKLAGCKLNVAEVTQSEIGQKQKLQ 

TVLEAVHDLLRPRGWALRWSVDSIHGKNLVAILHLLVSLAMHFRAPIRLPEHVTVQWWRKREGLLHSSHISEE 

LTTTTEMMMGRFERDAFDTLFDHAPDKLSWKKSLITFVNKHLNKLNLEVTELETQFADGVYLVLLM^^ 

PLHHFYLTPESFDQKVHNVSFAFELMLDGGLKKPKARPEDWNLDLKSTLRVLyiSILFTKyKNVE 



wo 2004/041170 



PCT/US2003/034312 



42/2825 

FIGURE 39 

GCTACGCGGGCCACGCTGCTGGCTGGCCTGACCTAGGCGCGCGGGGTCGGGCGGCCGCGCGGGCGGGCTGAGTGA 
GCAAGACAAGACACTCAAGAAGAGCGAGCTGCGCCTGGGTCCCGGCCAGGCTTGCACGCAGAGGCGGGCGGCAGA 

CGGTGCCCGGCGGAATCTCCTGAGCTCCGCCGCCCAGCTCTGGTGCCAGCGCCCAGTGGCCGCCGCTTCGAAAGT 
GACTGGTGCCTCGCCGCCTCCTCTCGGTGCGGGACCATGAAGCTGCTGCCGTCGGTGGTGCTGAAGCTCTTTCTG 
GCTGCAGTTCTCTCGGCACTGGTGACTGGCGAGAGCCXGGAGCGGCTTCGGAGAGGGCTAGCTGCTGGAACCAGC 
AACCCGGACCCTCCCACTGTATCCACGGACCAGCTGCTACCCCTAGGAGGCGGCCGGGACCGGAAAGTCCGTGAC 
TTGCAAGAGGCAGATCTGGACCTTTTGAGAGTCACTTTATCCTCCAAGCCACAAGCACTGGCCACACCAAACAAG 
GAGGAGCACGGGAAAAGAAAGAAGAAAGGCAAGGGGCTAGGGAAGAAGAGGGACCCATGTCTTCGGAAATACAAG 
GACTTCTGCATCCATGGAGAATGCAAATATGTGAAGGAGCTCCGGGCTCCCTCCTGCATCTGCCACCCGGGTTAC 
CATGGAGAGAGGTGTCATGGGCTGAGCCTCCCAGTGGAAAATCGCTTATATACCTATGACCACACAACCATCCTG 
GCCGTGGTGGCTGTGGXGCTGTCATCTGTCTGTCTGCTGGTCATCGTGGGGCTTCTCATGTTTAGGTACCATAGG 
AGAGGAGGTTATGATGTGGAAAATGAAGAGAAAGTGAAGTTGGGCATGACTAAXTCCCACTGAGAGAGACTTGTG 
CTCAAGGAATCGGCTGGGGACTGCTACCTCTGAGAAGACACAAGGXGATTTCAGACTGCAGAGGGGAAAGACTTC 
CAXCXAGXCACAAAGACXCCXXCGXCCCCAGXXGCCGXCXAGGAXXGGGCCXCCCAXAAXXGCXXXGCCAAAAXA 
CCAGAGCCTTCAAGTGCCAAACAGAGTATGTCCGATGGTATCTGGGXAAGAAGAAAGCAAAAGCAAGGGACCTXC 
AXGCCCXXCXGAXXCCCCXCCACCAAACCCCACXXCCCCXCAXAAGXXIGXXXAAACACXXATCXXCTGGAXXAG 
AAXGCCGGXXAAAXXCCAXAXGCXCCAGGAXCXXXGACXGAAAAAAAAAAAGAAGAAGAAGAAGGAGAGCAAGAA 
GGAAAGAXXXGXGAACXGGAAGAAAGCAACAAAGAXTGAGAAGCCAXGTACTCAAGXACCACCAAGGGAXCXGCC 
AXTGGGACCCXCCAGXGCXGGAXXTGAXGAGXXAACXGTGAAAXACCACAAGCCXGAGAACXGAAXXTXGGGACX 
TCXACCCAGAXGGAAAAATAACAACXAXXTXXGXXGTXGXXGXXXGXAAAXGCCXCXXAAATXATAXAXXXAXXX 
XAXXCXAXGXAXGXXAAXXXAXXXAGXXXXXAACAAXCXAACAAXAAXAXXTCAAGXGCCXAGACXGXXACXXXG 
GCAATTXCCTGGCCCXCCACTCCTCATCCCCACAATCTGGCXXAGXGCCACCCACCTTXGCCACAAAGCTAGGAX 
GGTXCTGTGACCCAXCXGTAGTAATXTATXGXCXGXCXACAXXTCXGCAGATCTXCCGTGGTCAGAGTGCCACXG 
CGGGAGCXCXGXAXGGXCAGGAXGXAGGGGXXAACTXGGXCAGAGCCACXCXAXGAGXXGGACXXCAGXCXXGCC 
XAGGCGAXXXTGTCTACCAXXTGTGTTXTGAAAGCCCAAGGXGCTGATGTCAAAGXGXAACAGATAXCAGTGTCT 
CCCCGXGXCCXCXCCCXGCCAAGTCTCAGAAGAGGXXGGGCXXCCAXGCCTGTAGCTTXCCXGGTCCCXCACCCC 
CAXGGCCCCAGGCCACAGCGXGGGAACXCACXXXCCCXXGXGXCAAGACAXXXCXCXAACXCCXGCCAXXCXXCX 
GGTGCTACXCCAXGCAGGGGXCAGXGCAGCAGAGGACAGTCTGGAGAAGGXAXTAGCAAAGCAAAAGGCXGAGAA 
GGAACAGGGAACAXTGGAGCXGACXGXXCXXGGXAACXGATXACCXGCCAATTGCXACCGAGAAGGXXGGAGGTG 
GGGAAGGCXTXGXAXAAXCCCACCCACCXCACCAAAACGAXGAAGGXAXGCXGXCAXGGXCCXXXCXGGAAGXXX 
CTGGXGCCAXTXCXGAACXGXTACAACXXGTAXTXCCAAACCXGGXXCAXAXXXAXACXXXGCAAXCCAAAXAAA 
GAXAACCCXXAXXCCAXAAAAAAAAAAAAAAAAAA 
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FIGURE 40 

MKLLPSWLKLFLAAVLSALVTGESLERLRRGLAAGTSNPDPPTVSTDQLLPLGGGRDRKVRDLQEADLDLLRVT 
LSSKPQALATPNKEEHGKRKKKGKGLGKKRDPCLRKYKDFCIHGECKYVKELRAPSCICHPGYHGERCHGLSLPV 
ENRLYTYDHTTILAWAWLSSVCLLVIVGLLMFRYHRRGGYDVENEEKVKLGMTNSH 
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FIGURE 41 

ACGCGTCCGCTTCGGAATGAGAGACXCAACCATAATAGAAAGAATGGAGAACTATTAACCACCATTCTTCAGTGG 
GCTGTGATTTTCAGAGGGGAATACTAAGAAATGGTTTTCCATACTGGAACCCAAAGGTAAAGACACTCAAGGACA 
GACATTTTTGGCAGAGCATAG ATGA AAATGGCAAGTTCCCTGGCTTTCCTTCTGCTCAACTTTCATGTCTCCCTC 
TTCTTGGTCCAGCTGCTCACTCCTTGCTCAGCTCAGTTTTCTGTGCTTGGACCCTCTGGGCCCATCCTGGCCATG 
GTGGGTGAAGACGCTGATCTGCCCTGTCACCTGTTCCCGACCATGAGTGCAGAGACCATGGAGCTGAGGTGGGTG 
AGTTCCAGCCTAAGGCAGGTGGTGAACGTGTATGCAGATGGAAAGGAAGTGGAAGACAGGCAGAGTGCACCATAT 
CGAGGGAGAACTTCGATTCTGCGGGATGGCATCACTGCAGGGAAGGCTGCTCTCCGAATACACAACGTCACAGCC 
TCTGACAGTGGAAAGTACTTGTGTTATTTCCAAGATGGTGACTTCTACGAAAAAGCCCTGGTGGAGCTGAAGGTT 
GCAGCATTGGGTTCTGATCTTCACATTGAAGTGAAGGGTTATGAGGATGGAGGGATCCATCTGGAGTGCAGGTCC 
ACTGGCTGGTACCCCCAACCCCAAATAAAGTGGAGCGACACCAAGGGAGAGAACATCCCGGCTGTGGAAGCACCT 
GTGGTTGCAGATGGAGTGGGCCTGTATGCAGTAGCAGCATCTGTGATCATGAGAGGCAGCTCTGGTGGGGGTGTA 
TCCTGCATCATCAGAAATTCCCTCCTCGGCCTGGAAAAGACAGCCAGCATATCCATCGCAGACCCCTTCTTCAGG 
AGCGCCCAGCCCTGGATCGCGGCCCTGGCAGGGACCCTGCCTATCXCGTTGCTGCTTCTCGCAGGAGCCAGTTAC 
TTCTTGTGGAGACAACAGAAGGAAAAAATTGCTCTGTCCAGGGAGACAGAAAGAGAGCGAGAGATGAAAGAAATG 
GGATACGCTGCAACAGAGCAAGAAATAAGCCTAAGAGAGAAGCTCCAGGAGGAACTCAAGTGGAGGAAAATCCAG 
TACATGGCTCGTGGAGAGAAGTCTTTGGCCTATCATGAATGGAAAATGGCCCTCTTCAAACCTGCGGATGTGATT 
CTGGATCCAGACACGGCAAACGCCATCCTCCTTGTTTCTGAGGACCAGAGGAGTGTGCAGCGTGCTGAAGAGCCG 
CGGGATCTGCCAGACAACCCTGAGAGAXTTGAATGGCGTTACTGTGTCCTTGGCTGTGAAAACTTCACATCAGGG 
AGACATTACTGGGAGGTGGAAGTGGGGGACAGAAAAGAGTGGCATATTGGGGTATGTAGTAAGAACGTGGAGAGG 
AAAAAAGGTTGGGTCAAAATGACACCGGAGAACGGATACTGGACTATGGGCCTGACTGATGGGAATAAGTATCGG 
GCTCTCACTGAGCCCAGAACCAACCTGAAACTTCCTGAGCCTCCTAGGAAAGTGGGGATCTTCCTGGACTATGAG 
ACTGGAGAGATCTCGTTCTATAATGCCACAGATGGATCTCATATCTACACCTTTCCGCACGCCTCTTTCTCTGAG 
CCTCTATATCCTGTTTTCAGAATTTTGACCTTGGAGCCCACTGCCCTGACCATTXGCCCAATACCAAAAGAAGTA 
GAGAGTTCCCCCGAXCCXGACCXAGTGCCTGATCAXTCCCXGGAGACACCACTGACCCCGGGCXXAGCXAATGAA 
AGTGGGGAGCCXCAGGCTGAAGTAACATCTCTGCXXCXCCCXGCCCACCCTGGAGCTGAGGTCXCCCCTTCTGCA 
ACAACCAATCAGAACCATAAGCXACAGGCACGCACXGAAGCACTXXACTGAXAXXCAXXCCATTATTCCAXAXGA 
CAGTXGTTXXGAGXXXCGXACCACCTTATTGXCCCCXXAXACAGATAAGGAAACTGGGGXGCAGAAAGGTGAAXT 
AACTXTACAAAGXAGACAXGACAAGTGAACAGCAGAGCXGGGAXCTAAACAGCAATAACTAACAXXAACAGAGAA 
XXTAAAAXGTXCTTAGXGCXGXGXXAXAAGCTXXGGXGGATGXCACXCCXXXAATCCTCACAACACCCTGXCGGG 
TAGXCAXATTTTGCAAGXAXGGAAGCXGAGGCAGGGCAACAXGAAGXAACTTACATAATXCAXACAGXAAXXTGT 
GCAGXTGGGAGAXGTTCAGCCTTAGTCCCTGGCTAAXXGCCTGTTCTTTTCCAGCCXGAXXXTTTTTCCCACAGG 
AAGAGCCCACAXGTAGCCCXGAGGXXTCCTTCCCAGGACAGCXGCAGGGXAGAGAXCATXTXAAGXGCXXGTGGA 
GXXGACATCCCTATXGACXCXXXCCCAGCTGAXATCAGAGACXXAGACCCAGCACTCCTXGGATXAGCXCTGCAG 
AGTGTCXTGGTTGAGAGAATAACCTCATAGTACCAACATGACATGTGACXTGGAAAGAGACXAGAGGCCACACXX 
GAXAAAXCAXGGGGCACAGAXAXGXXCCCACCCAACAAAXGXGATAAGXGATXGXGCAGCCAGAGCCAGCCXXCC 
XXCAATCAAGGXXTCCAGGCAGAGCAAATACCCXAGAGATXCXCXGXGAXAXAGGAAAXTXGGATCAAGGAAGCT 
AAAAGAAXXACAGGGATGTTTTTAATCCCACXAXGGACTCAGTCTCCTGGAAATAGGTCTGTCCACTCCTGGTCA 
TTGGXGGATGXXAAACCCAXAXXCCXXXCAACXGCTGCCXGCXAGGGAAAACXGCXCCXCAXXAXCAXCACXAXX 
AXXGCTCACCACTGTATCCCCXCTACTTGGCAAGXGGXXGTCAAGXXCXAGXTGXXCAATAAATGTGXTAAXAAT 
GAAAAAAAAAAA 
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FIGURE 42 

MKMASSLAFLLLNFHVSLFLVQLLTPCSAQFSVLGPSGPILAMVGEDADLPCHLFPTMSAETMELRWVSSSLRQV 
VNVYADGKEVEDRQSAPYRGRTSILRDGITAGKAALRIHNVTASDSGKYLCYFQDGDFYEKALVELKVAALGSDL 
HIEVKGYEDGGIHLECRSTGWYPQPQIKWSDTKGENIPAVEAPWADGVGLYAVAASVIMRGSSGGGVSCIIRNS 
LLGLEKXASISIADPFFRSAQPWIAALAGTLPISLLLLAGASYFLWRQQKEKIALSRETEREREMKEMGYAATEQ 
EISLREKLQEELKWRKIQYMARGEKSLAYHEWKMALFKPADVILDPDTANAILLVSEDQRSVQRAEEPRDLPDNP 
ERFEWRYCVLGCENFTSGRHYWEVEVGDRKEWHIGVCSKNVERKKGWVKMTPENGYWTMGLTDGNKYRALTEPRT 
NLKLPEPPRKVGIFLDYETGEISFYNATDGSHIYTFPHASFSEPLYPVFRILTLEPTALTICPIPKEVESSPDPD 
LVPDHSLETPLTPGLANESGEPQAEVTSLLLPAHPGAEVSPSATTNQNHKLQARTEALY 
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FIGURE 43 

GGCACGAGGGTAGTGAGCGGTGTTTCAGGATGTGAGGGCCCGCAGGAGCCGAGTCAGGCTCTCTCCACTGCCTGC 
CCGCCACCGTGCAAGCTCTGGCCGGCGCTGCCCACAGTCCCCATGGTGGGCAGCCCCCGCGGCGGGGACCCCTGA 
TCGGCAGCGGC ATG CCAGGGAAGCCCAAGCACCTGGGCGTCCCCAACGGGCGCATGGTTCTGGCTGTGTCAGATG 
GAGAGCTGAGCAGCACGACGGGGCCCCAGGGCCAGGGCGAGGGCCGCGGCAGCTCTCTCAGCATCCACAGCCTCC 
CCAGTGGTCCCAGCAGCCCCTTCCCAACCGAGGAGCAGCCTGTGGCCAGCTGGGCCCTGTCCTTCGAGCGGCTGT 
TGCAGGACCCGCTGGGCCTGGCTTACTTCACTGAGTTCCTGAAGAAGGAGTTCAGCGCGGAAAACGTGACTTTCT 
GGAAGGCCTGCGAGCGCTTCCAGCAGATCCCGGCCAGCGATACCCAGCAGCTAGCTCAGGAGGCCCGCAACATCT 
ACCAGGAGTTCCTGTCCAGCCAGGCGCTGAGCCCAGTGAACATCGACCGTCAGGCCTGGCTTGGCGAGGAGGTGC 
TGGCCGAGCCCCGGCCGGACATGTTTCGGGCACAGCAGCTTCAGATCTTCAACTTGATGAAGTTCGACAGCTATG 
CGCGCTTCGTCAAGTCCCCGCTGTACCGCGAGTGCCTGCTAGCCGAAGCCGAGGGACGCCCTCTGCGGGAACCTG 
GCTCCTCGCGCCTCGGCAGCCCTGACGCCACGAGGAAGAAGCCGAAGCTGAAGCCCGGGAAGTCGCTGCCGCTGG 
GTGTGGAGGAGTTGGGGCAGCTGCCACCCGTTGAGGGTCCTGGGGGCCGCCCTCTCCGCAAGTCCTTCCGCCGGG 
AGCTGGGCGGGACTGCAAACGCCGCCTTGCGCCGAGAGTCTCAGGGCTCCCTCAACTCCTCCGCCAGCCTGGACC 
TTGGCTTCCTAGCCTTCGTCAGCAGCAAATCTGAGAGCCACCGGAAGAGCCTTGGGAGCACGGAGGGTGAAAGTG 
AAAGCCGGCCAGGGAAGTACTGCTGTGTGTACCTGCCCGATGGCACAGCCTCCTTGGCCCTGGCCAGACCTGGCC 
TCACCATCCGAGACATGCTGGCAGGGATCTGTGAGAAACGAGGCCTCTCTCTACCTGACATCAAGGTCTACCTGG 
TGGGCAATGAACAGGCCCTGGTCCTGGATCAGGACTGCACCGTGCTGGCGGATCAGGAAGTGCGGCTGGAAAACA 
GGATCACCTTCGAGCTGGAGCTGACGGCGCTGGAGCGCGTGGTACGAATCTCAGCCAAGCCCACCAAGCGGCTGC 
AGGAGGCGCTGCAGCCCATTCTGGAGAAGCACGGCTTGAGCCCGCTAGAGGTGGTGCTGCACCGGCCAGGCGAGA 
AACAGCCTCTGGATCTGGGGAAGCTAGTGAGCTCGGTGGCGGCCCAGAGACTGGTTTTGGACACTCTTCCAGGTG 
TGAAGATCTCCAAAGCCCGTGACAAATCTCCCTGCCGCAGCCAGGGCTGCCCACCTAGAACTCAGGATAAGGCCA 
CCCATCCCCCTCCAGCGTCCCCCAGTTCTCTGGTGAAGGTGCCCAGTAGTGCCACTGGAAAGCGGCAGACCTGTG 
ACATCGAAGGCCTGGTGGAGCTGCTGAACCGGGTGCAGAGCAGCGGGGCCCACGACCAGAGGGGCCTTCTGAGGA 
AAGAGGACCTGGTACXTCCAGAATTTCTGCAGCTGCCCGCCCAAGGGCCCAGCTCCGAGGAGACCCCACCACAGA 
CCAAATCAGCAGCCCAGCCCATCGGGGGATCCTTGAACTCCACCACCGACTCAGCCCTCTGACAGCTACCCAACA 
GTCCAGGACAGCTGCATGGCACCCGGCGGGCCGAGCATGCCATGGGTCCGCTCTGCATGCCCTGTCTGTGCCATG 
AGTGTCCCTGGCCCCTTCCTGCCATGGGCAGGCCCGCAGGAAGAGCCGGTAGGGGTGGAAAGGGGACTCAGATGA 
GACACACCCCACAGCTGCCACCGCCTTGTCCCTCAACAAGCTCACCCCCAATCCCTTGCAGCCAGGCCACAATGG 
GGGAGGTGAGTCCAGCCCCTTGGAACAGGCTTGCCCAACATGGAGGGATGGCGTTGGCAGTGCCAGCCTCCCCAG 
CCTGTGCCAAGCTTCAACAGGGGCAAGAGGAGGGGCCGGCCCCTCCTCAGGAAGCTGGTATGAGTAAGGCCTTGA 
GGGTGCAGGCAGGCAGCCCTGTACCCCACCCACATAGACTATACTGTACATACAGATTTTGCAGTAGGCTTGGGG 
CAGCTGGGTTTGTCCTTGATGTATGATACTGXTATTATAATAATTATTATTATTCTGCAAAAAAAAAAAAAAAAA 
AA 
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FIGURE 44 

MPGKPKHLGVPNGRMVLAVSDGELSSTTGPQGQGEGRGSSLSIHSLPSGPSSPFPTEEQPVASWALSFERLLQDP 
LGLAYFTEFLKKEFSAENVTFWKACERFQQIPASDTQQLAQEARNIYQEFLSSQALSPVNIDRQAWLGEEVLAEP 
RPDMFRAQQLQIFNLMKFDSYARFVKSPLYRECLLAEAEGRPLREPGSSRLGSPDATRKKPKLKPGKSLPLGVEE 
LGQLPPVEGPGGRPLRKSFRRELGGTANAALRRESQGSLNSSASLDLGFLAFVSSKSESHRKSLGSTEGESESRP 
GKYCCVYLPDGTASLALARPGLTIRDMLAGICEKRGLSLPDIKVYLVGNEQALVLDQDCTVLADQEVRLENRITF 
ELELTALERWRISAKPTKRLQEALQPILEKHGLSPLEWLHRPGEKQPLDLGKLVSSVAAQRLVLDTLPGVKIS 
KARDKSPCRSQGCPPRTQDKATHPPPASPSSLVKVPSSATGKRQTCDIEGLVELLNRVQSSGAHDQRGLLRKEDL 
VLPEFLQLPAQGPSSEETPPQTKSAAQPIGGSLNSTTDSAL 
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FIGURE 45 

cccctgcccgcccgacagcgccgccgcctgcccc6cc atgg gtcgaca6aag6agct6gtgtcccgctgcgggga 
gatgctccacatccgctaccggctgctccgacaggcgctggccgagtgcctggggaccctcatcctggtgatgtt 

tggctgtggctccgtggcccaggttgtgctcagccggggcacccacggtggtttcctcaccatcaacctggcctt 
tggctttgctgtcactctgggcatcctcatcgctggccaggtctctggggcccacctgaacccxgccgtgacctt 
tgccatgtgcttcctggctcgtgagccctggatcaagctgcccatctacaccctggcacXgacgctgggagcctt 
cttgggtgctggaatagtttttgggctgtattatgatgcaatctggcacttcgccgacaaccagctttttgtttc 
gggccccaatggcacagccggcatcxttgctacctacccctctggacacttggatatgatcaatggcttctttga 
ccagttcataggcacagcctcccttatcgtgtgtgtgctggccattgttgacccctacaacaaccccgxcccccg 
aggcctggaggccttcaccgtgggcctggtggtcctggtcattggcacctccatgggcttcaactccggctatgc 
cgtcaaccctgcccgggactttggcccccgcctxtttacagcccttgcgggctggggctctgcagtcttcacgac 
cggccagcattggxggtgggtgcccatcgxgxccccacxcctgggctccattgcgggxgxcxxcgxgxaccagcx 
gaxgatcggcxgccacctggagcagcccccacccxccaacgaggaagagaaxgtgaagctggcccatgtgaagca 
caaggagcagaxc tgag xgggcaggggccaxcxccccacxccgctgcccxggccttgagcatccactgactgxcc 
aagggccactcccaagaagcccccttcacgatccacccxxxcaggcxaaggagcxcccxaxcxacccxcacccca 
cgagacagcccctxcaggatttccactggaccttgcccaaatagcaccttaggccactgcccctaagctggggtg 
gaaccggaaxxxgggxcaaxacaxccxxxxgtcxcccaagggaagagaatgggcagcaggxatgxgtgtgxgxgc 
axgxgtgtgcaxgxgxgtgcaxgtgxgxgcaggggxgxgxgxgxgxggggggggxxcccagaxattcagggcaag 
ggaccagtcggaagggatxcxggctattgggggagcccagagacaggggaaggcagcctgtccaxctgxgcaxaa 
ggagaggaaagxxccagggxgxgxaxgxxxcaggggctxcacatggaggagcxgcagatagaxaxgxgxxxcxgx 
gxaxgxgxaxgxcxgccxxxxxxxcxaagtgggggcttctacaggcttttgggaagtagggxggaxgxgggxagg 
gcxgggaggagggggccacagcxtaggttxggagctctggatgtacaxacaxaagxaggagcagtgggacgtgtx 
tctgxcataaxgcaggcaxgaagggxggagtgaagtcaggtcaxaagtxtcatgxxxgcxxxtgtxxtgxxxtgt 
txtxaatgxatgtagcagatgxtacagxcxtagggatccgggaxgggagaccccactttagaaagggxcgxcact 
cctttaaxcctctactcaacaatgxacxcxxxxacxxxxaxaxxaaaaaaaataaaaxaaaxatgxgccxaaaaa 
aaaaaaaaaa 
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FIGURE 46 

MGRQKELVSRCGEMLHIRYRLLRQALAECLGTLIIiVMFGCGSVAQWLSRGTHGGFLTINLAFGFAVTLGILIAG 
QVSGAHLNPAVTFAMCFLAREPWIKLPIYTLAQTLGAFLGAGIVFGLYYDAIWHFADNQLFVSGPNGTAGIFATY 

PSGHLDMINGFFDQFIGTASLIVCVLAIVDPYNNPVPRGLEAFTVGLWLVIGTSMGFNSGYAVNPARDFGPRLF 
TALAGWGSAVFTTGQHWWWVPIVSPLLGSIAGVFVYQLMIGCHLEQPPPSNEEENVKLAHVKHKEQI 
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FIGURE 47 

TTTAGGTAAAACCGGATTTAACCCTTGGGAAAAAAAAAAAAAAAGGAGAGAGAAGACAGTTCCTTTCCTGTAffi^ 
ATTAAAACAAAATACAAATTGAGGAAGCTCTGCTACCCAGGCTGTCATGGTAGAGAACTXGAAG2\AGACCTGTTT 
GGATGGACACCTGGTTTCAAAAGTCAGGTGTGGAGACTGTTAAATGGGAGGGCCTCATCCATAAATGATTTCTGG 
CAACGTCXTCTTCAGGTGGAGCTTGACGTCTXTTTAATGTTACTTGGGGAGGGAGTGCTCATTAAGGGAXGCCAG 
GGCCAGCXCTGGXGGXXCCXGGGGAGGCTGCGTCCXXCCCTGCTXCTGCAXGXCAXGAGGCAGCAGGAAGGTTTC 
CCCXGCACCXGXCXGTCCTGGCTCCCTCXGGGTAGCCCCCXACXGXXCXGXGCTTCAGCACAGCCXGGXTXGXCA 
AGAGGCACAXAGXTGGGGCXGGGCXGCAXGGCACAGGGGCXXAXGXGCCXGCXGGXXATXTAATTXXCAGCCXXA 
AGTTXXCXXXAAXATTXTCCXGTXGGCXAXXXAAAGGXTTTGGXTATCXXXXATXCCXXAXCTACAAXCAAGAXG 
ACAAXGTAAXXGAAXXATCXTATTXATAACACGGXTCGTGATXCATGATTCAXGAXXACAAGTAGAAAATAXGXC 
AXGXXCCXCACCTCCAAAXAAAXAXGXGXGTGTCXGXNXGNGXGXCXAXAXGXAXGTGCGGAGAGGGAGAGAGTG 
GGGAAGGAGAGCAGXGXXAXCAXACAXAGAGAGGCXAAATGXGTCCCAXCCCXCACXGXCAGCTXXAXAAAGGAG 
XXTGACTCCATCCACAGAAGAAXGTTTTAXAAGACTAGGAAAACACGXTGAAAACTAGGAXAAACAGCAACAAAA 
ATCAACXAAAXAXGXTGTTACXGXTGCXAAGGATXXTCTCCTXAGAAXAATTTAGGAXTXXXAAAAATTXCXGXT 
XGCCAAATGCTGXAGAXAAATGGCCAGAXXCTXCCTAXCCCTAGGAXXCCTXXAXXATTTXXXTXCACAGAXXXX 
GAGAACAAGGGGGAGAGAXAGXAXGGAAGAXTAAGATTCCAXTAATCTTATAGAACTGTGTTGXCACCCAAATXC 
CTGCXTGXXXGAACAXGGCAXCXXCAXAGAXXCAGGAXXCACXACCCXCXAXAGCXGGAXCXTGAAAAXXAXCTG 
GCCAGATAATXXXGCATCTGCXXGGAXGAXXGXAGACXGAGATGTGAGXGGAGGAXAAAGTAXXAGACTTTTGCT 
GAGXAACXGCCAACCAAGAAGXATXXAXCGGACACTXACXAGGTGCCXAGGAXXGTATCAGAGGGAATATGAAAX 
GXGTCCCTGCCCXACCXAGXXTXAACGACAGAAXAXCXATXAAAGGCXACTTAGCXGAAGGGTAAGGGXGACAGG 
TCTAGGGGAAGCTTTGGGAGGXGGTGXGCTGXGACAGAAAAAGXGGCAGAGXAGGGACGAGAGACCXGCAXXCTA 
GCCCXGXTTCTGXCACXXGCTCTGXACACTTAGACAACAGCXXGACCXCTXGAGCTXXAGXTXCCTCCXCXGCAX 
AATGAGAGGGTTAGACTACXGAATTGTAXGGGAAAAAAATACAAATXCCTGGGTCCXAGGCCATGCCXGCXGAAT 
CCGACXGXTCAGGAAGAGGCCTAGGAAAXCXGXGAGGGAAXCCCCAGGGGAATCXCGTGACCAGCCAGGXGTGAA 
AXCXGCXAACXGGAAGATCXCAAAGCXTCCXXCACXXXTTGXGAXXTTGTGGTCAXGXAACGXXACXGTATXAXT 
CXACGXAAATGTGGGTACXXGGAXGTTTAXCATACXGXXXCTCTGXGTTTACATACTAAXXXGTGTAAGAAAXGC 
ATTTXAGTCXGTGXACCXCAACCTGCXGXXXGXTTCCXAGAGGXGXXAGTAGXCXTTAAATACAAGXAAGACXTA 
AGAGGATAXXXGATGXXAXTXACCXGGAXAXXTXCXXCCCCTTXXAXXTATTXAGAGGAAATTGAGAXXCXAGGA 
GCCAAAAAAAXGAAAACAAAAXXCXAAGGCAAAGXXAAAGAAAAAAAXTACATTATXXCXXACCATXTGCXACTX 
TAXAAXGAAAAXXXAAAAAXTATAXGGGAAGAXXTXTCTCXGGGATAACAAATCCXTGTCAXAAAGXAAGAGGTC 
TXXXXAAAGXAGGXAGGCXAXAAGGCCXGXAAXTXAAAAXAAXACXCCXXXCTCTAGGGTXXGGXGCAATXCTCC 
AXXAATGAAGATAACAXTTGAAXTCCCCAAAGCAGGXGAGGAGXCGGGGAGGAGAAAGCGATGTXAAAATGAAAA 
CTCACXGCAAAAGAGGAGGCAGAGGAAGAAGGAAXGXAAACCCCXXAAAGCAGAXGTGXGXGGGGCCTTAXGAAG 
ACCAGGATXCXGCGGGXGXCAGGGGAXXGCCCCTCTTGACAGAGACXAGGGTXXXAGACXGAGGCXTCCXGCAGG 
GTGTTCGCATXGCCCTTCXCCGTTCCCCTXCAGACCTXTCTGGGGAGAAGAGGXGGGAGGAGGGGAGAAAGACXG 
XXCAXCTXATTCXGAAXCCXGGAGCAGCTGAAGGXXXXCXCXXGAGXCAGGAXGCAGXGGXAAXGCATXAACCAG 
CAAGXGXGGCCAAGGAXAAXGAAAAAGXGGGAAAGGAAGGXCCXCCXCCXCCCTGAXTGXAGCAXCCAGCAGXCX 
CXGXAGCCAGGTXACXCAAGAACCACATTXGAXXXCCXGGCCCXXXGCCTTGGCAGTGATGGCATTTTTATTTCA 
CXGXGXXXXAAAGXCXTCATXXAXXXXXATAACAXGGGXXAGGGAGAAGGGCCACAAATGGAGGGAXXGXCCXXX 
CAAGCACCACAGCTTCAGATAAAATXAGTACXTTCAAATAXXGXCCACXXXAACXXAAAAAAXXCXAGAGGGATT 
AXAXXGGAGACXCAACTGCCCXXGGTTXXAGTTTAXAAAAXGGCCTAGXACTGTGGAAXXXXAAXXXXAGAAAGX 
CXXAGCAXCAGATCAXAAACAXTCATXAAAAGAACXCACATCCCAXCTGAAACXXCCCAGGGGAGXXGGGATTCX 
XAGXAGATXGGTAGAAAGGGGCXCAXXXTCTACXGCATXXCCCATTTXXGGXAXCXXGXXCAGCATGTTXXAXXX 
XTAXTXCXXGXCTGCAGAACAXCCXATAXXXATGAGAACATXCXXXAAGAAGACCACCACAXAGAAXACCCCXXC 
CXATCAGCXCGCXCTGAXTXAGCCXTAAXTTTGXXAAAXXXTXXAGAGAXGAATGAAGXGCXGCXGXGGAAAGAA 
AXGXACAXAXACXAXXXCXGXAXCAXXAAAAXXACAXTXXXAXGGXXCAAG 
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CAGCCCGAGCCCGAGCCCGAGCCCGAGCCGGCGCCACCGCGCCCCCGGCC ATGG CTTTTGCCAATTTCCGCCGCA 
TCCTGCGCCTGTCTACCTTCGAGAAGAGAAAGTCCCGCGAATATGAGCACGXCCGCCGCGACCTGGACCCCAACG 
AGGTGTGGGAGATCGTGGGCGAGCTGGGCGACGGCGCCTTCGGCAAGGTTTACAAGGCCAAGAATAAGGAGACGG 
GTGCTTTGGCTGCGGCCAAAGTCAXTGAAACCAAGAGTGAGGAGGAGCTGGAGGACTACATCGTGGAGATTGAGA 
TCCTGGCCACCTGCGACCACCCCTACATTGTGAAGCTCCTGGGAGCCTACTATCACGACGGGAAGCTGTGGATCA 
TGATTGAGTTCTGTCCAGGGGGAGCCGTGGACGCCATCATGCTGGAGCTGGACAGAGGCCTCACGGAGCCCCAGA 
TACAGGTGGTTTGCCGCCAGATGCTAGAAGCCCTCAACTTCCTGCACAGCAAGAGGATCATCCACCGAGATCTGA 
AAGCTGGCAACGTGCTGATGACCCTCGAGGGAGACATCAGGCTGGCTGACTTTGGTGTGTCTGCCAAGAATCTGA 
AGACTCTACAGAAACGAGATTCCTTCATCGGCACGCCTTACTGGATGGCCCCCGAGGTGGTCATGTGTGAGACCA 
TGAAAGACACGCCCTACGACTACAAAGCCGACATCTGGXCCCTGGGCATCACGCTGATTGAGATGGCCCAGATCG 
AGCCGCCACACCACGAGCTCAACCCCATGCGGGTCCTGCTAAAGATCGCCAAGTCAGACCCTCCCACGCTGCTCA 
CGCCCTCCAAGTGGTCTGTAGAGTTCCGTGACTTCCTGAAGATAGCCCTGGATAAGAACCCAGAAACCCGACCCA 
GTGCCGCGCAGCTGCTGGAGCATCCCTTCGTCAGCAGCATCACCAGTAACAAGGCTCTGCGGGAGCTGGTGGCTG 
AGGCCAAGGCCGAGGTGATGGAAGAGATCGAAGACGGCCGGGATGAGGGGGAAGAGGAGGACGCCGTGGATGCCG 
CCTCCACCCTGGAGAACCATACTCAGAACTCCTCTGAGGTGAGTCCGCCAAGCCTCAATGCTGACAAGCCTCTCG 
AGGAGTCACCTTCCACCCCGCXGGCACCCAGCCAGTCTCAGGACAGTGXGAATGAGCCCTGCAGCCAGCCCTCTG 
GGGACAGAXCCCXCCAAACCACCAGXCCCCCAGXCGXGGCCCCXGGAAATGAGAACGGCCXGGCAGXGCCXGTGC 
CCCXGCGGAAGXCCCGACCCGTGXCAAXGGAXGCCAGAAXXCAGGTAGCCCAGGAGAAGCAAGXTGCXGAGCAGG 
GXGGGGACCTCAGCCCAGCAGCCAACAGAXCTCAAAAGGCCAGCCAGAGCCGGCCCAACAGCAGCGCCCXGGAGA 
CCXXGGGTGGGGAGAAGCXGGCCAAXGGCAGCCXGGAGCCACCTGCCCAGGCAGCTCCAGGGCCXTCCAAGAGGG 
ACTCGGACXGCAGCAGCCTCTGCACCTCTGAGAGCAXGGACTATGGXACCAATCTCXCCACXGACCTGTCGCXGA 
ACAAAGAGATGGGCXCTCXGXCCATCAAGGACCCGAAACXGTACAAAAAAACCCTCAAGCGGACACGCAAAXTTG 
XGGXGGAXGGTGTGGAGGXGAGCATCACCACCXCCAAGAXCAXCAGCGAAGATGAGAAGAAGGAXGAGGAGAXGA 
GAXXXCXCAGGCGCCAGGAACXCCGAGAGCXXCGGCTGCTCCAGAAAGAAGAGCAXCGGAACCAGACCCAGCXGA 
GTAACAAGCATGAGCXGCAGCXGGAGCAAATGCATAAACGXTTTGAACAGGAAAXCAACGCCAAGAAGAAGXXCX 
XXGACACGGAAXXAGAGAACCXGGAGCGXCAGCAAAAGCAGCAAGTGGAGAAGAXGGAGCAAGACCATGCCGXGC 
GCCGCCGGGAGGAGGCCAGGCGGAXCCGCCXGGAGCAGGAXCGGGACXACACCAGGTXCCAAGAGCAGCXCAAAC 
XGAXGAAGAAAGAGGTGAAGAACGAGGXGGAGAAGCTCCCCCGACAGCAGCGGAAGGAAAGCATGAAGCAGAAGA 
XGGAGGAGCACACGCAGAAAAAGCAGCXXCXXGACCGGGACXXXGTAGCCAAGCAGAAGGAGGACCXGGAGCXGG 
CCAXGAAGAGGCTCACCACCGACAACAGGCGGGAGAXCTGTGACAAGGAGCGCGAGTGCCXCATGAAGAAGCAGG 
AGCXCCTXCGAGACCGGGAAGCAGCCCXGXGGGAGAXGGAAGAGCACCAGCTGCAGGAGAGGCACCAGCTGGTGA 
AGCAGCAGCXCAAAGACCAGXACXXCCXCCAGCGGCACGAGCXGCXGCGCAAGCATGAGAAGGAGCGGGAGCAGA 
XGCAGCGCXACAACCAGCGCAXGATAGAGCAGCXGAAGGTGCGGCAGCAACAGGAAAAGGCGCGGCTGCCCAAGA 
XCCAGAGGAGXGAGGGCAAGACGCGCAXGGCCATGTACAAGAAGAGCCXCCACATCAACGGCGGGGGCAGCGCAG 
CXGAGCAGCGXGAGAAGAXCAAGCAGTXCXCCCAGCAGGAGGAGAAGAGGCAGAAGXCGGAGCGGCXGCAGCAAC 
AGCAGAAACACGAGAACCAGAXGCGGGACAXGCXGGCGCAGTGTGAGAGCAACAXGAGCGAGCTGCAGCAGCXGC 
AGAATGAAAAGXGCCACCXCCXGGXAGAGCACGAAACCCAGAAACTGAAGGCCCTGGATGAGAGCCAXAACCAGA 
ACCTGAAGGAAXGGCGGGACAAGCXTCGGCCGCGCAAGAAGGCXCXGGAAGAGGAXCXGAACCAGAAGAAGCGGG 
AGCAGGAGATGTXCTXCAAGCXGAGCGAGGAGGCGGAGXGCCCAAACCCCXCCACCCCAAGCAAGGCCGCCAAGT 
XCTTCCCCTACAGTTCTGCGGAXGCXTCXJDA^AACCGCCCGGGGCXGTGGCXGGCAGCXXGGXGGGCCCCAGGG 
CCXXCXCCXXCAXTCTCXGXGAACATGXAACTCAGGACCCCXXTXCCCXCXTGCGTCXGTGCCAGCTCAAATCCA 
GCCCCXCGCCCXGXGCCACCCCAACXGXGCCXGAXAGACCXGCCCCAGCGXXCCXGACXXCXXGCXGGCCXGXGG 
AGGGTGAGGTGXAATTAXXXGXCACCTGAACCXAAXGXAXAXTCTCCXXGAGCCCCAGAXCCCXXCAAGCTGGAA 
GGGAXGGGGCXGXXGGTGGGGXCAGGGTCCAAGAGGAAXGGGXGXXCXGTGGCCXCGAGXCCTCXCCXGXXXGCG 
AAAAXACCAGXXXXGCXCXCXGXGGGACAAAGCACXGCXGATGAAGXCCCCGTGGGCXCAXCCGGGCXGGAAXXC 
XXGGTXXXXCAGCCATXCCCXGCAGAGXCACXCAATCAXCAAGXCCCXCACAGCCAXXXCTGTXCCCAGAGGAGC 
CAGGCCXGCAGCXGGCTGCTCAGGAGAXGGCCXCAXXCCTCCXGXXCXCCCAGXXXGCXXXCCACXTAAGACAAA 
GCCXXGXCXAXGXGGGGGGCGGGGACCGGGGAAAGAGGGAGGCXGAAAXGXXXAXXCXGCXXCXCCCGXGXXXCA 
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TGCCATCTCGCGTCCCCCTTCCTGCACATGGGTGTGAATGCACACACAXACGCGTACACACAGGACTTGGTCTGC 
TGGCCTGGCCTCTTCTGCCCAGGTGGGTTGGAACACGTTTGCTGCCTGAGCCTGTGCCACTGAGCATGTTAGGTG 
GAGCAGTTGGTGTGGCACGTGCGGGGTGTTGGCACCGGAGGCATGGAAAAGCACAGGCTGTACTGCCAGGCTGCG 
ATGCGTGCTGGCCCCCGCACAGGCTCCTGTGTGCAGGGACXGATTCCXCAGCACACGAGGCTTCCACAACCCAGT 
CTGCTCCATAGCACTCTGGCCCACCCTGTCTGCAGGTGAAACAGGAGGGCTGTTTGCCTCTGCCCCATCCCCCCG 
ACTGTGTTCAGGAGTCCCACCTTGCATTTCAGACCTGGGCTGGCAGTCTGTTGGACTTCTCTTCAGGAAGAAAAA 
GCATCAGGGGGAAATGGAATGCCCCTGCCCCAGGAACATGGCAGAAGCACAGGTTCTGTACCTCAGATGGACTCC 
TGCTGGGCCTTCGGGGTCTCAGTTGGCTTCCCCCAGATTCTGATTCTACAGCTGCAGAATGTATATAACACAATA 
AAAGCAAATGTTTGAACCAGT 
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MAFANFRRILRLSTFEKRKSREYEHVRRDLDPNEVWEIVGELGDGAFGKVYKAKNKETGALAAAKVIETKSEEEL 
EDYIVEIEILATCDHPYIVKLLGAYYHDGKLWIMIEFCPGGAVDAIMLELDRGLTEPQIQWCRQMLEALNFLHS 
KRIIHRDLKAGNVLMTLEGDIRLADFGVSAKNLKTLQKRDSFIGTPYWMAPEWMCETMKDTPYDYKADIWSLGI 
TLIEMAQIEPPHHELNPMRVLLKIAKSDPPTLLTPSKWSVEFRDFLKIALDKNPETRPSAAQLLEHPFVSSITSN 
KALRELVAEAKAEVMEEIEDGRDEGEEEDAVDAASTLENHTQNSSEVSPPSLNADKPLEESPSTPLAPSQSQDSV 
NEPCSQPSGDRSLQTTSPPWAPGNENGLAVPVPLRKSRPVSMDARIQVAQEKQVAEQGGDLSPAANRSQKASQS 
RPNSSALETLGGEKLANGSLEPPAQAAPGPSKRDSDCSSLCTSESMDYGTNLSTDLSLNKEMGSLSIKDPKLYKK 
TLKRTRKFWDGVEVSITTSKIISEDEKKDEEMRFLRRQELRELRLLQKEEHRNQTQLSNKHELQLEQMHKRFEQ 
EINAKKKFFDTELENLERQQKQQVEKMEQDHAVRRREEARRIRLEQDRDYXRFQEQLKLMKKEVKNEVEKLPRQQ 
RKESMKQKMEEHTQKKQLLDRDFVAKQKEDLELAMKRLTTDNRREICDKERECLMKKQELLRDREAALWEMEEHQ 
LQERHQLVKQQLKDQYFLQRHELLRKHEKEREQMQRYNQRMIEQLKVRQQQEKARLPKIQRSEGKTRMAMYKKSL 
HINGGGSAAEQREKIKQFSQQEEKRQKSERLQQQQKHENQMRDMLAQCESNMSELQQLQNEKCHLLVEHETQKLK 
ALDESHNQNLKEWRDKLRPRKKALEEDLNQKKREQEMFFKLSEEAECPNPSTPSKAAKFFPYSSADAS 
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CATGTTGCCCTGGGGGGAACCTGCCAGCCCCGTAGCACTGCCCACCCCACCCACTGTGGTCTGTTGTACCCCACT 
GCTGGGGTGGTGGTTCCAATGAGACAGGGCACACCAAACTCCATCTGGCTGTTACTGAGGCGGAGACACGGGTGA 
TGATTGGCTTTCTGGGGAGAGAGGAAGTCCTGTGATTGGCCAGATCTCTGGAGCTTGCCGACGCGGTGTGAGGAC 
GCTCCCACGGAGGCCGGAATTGGCTGTGAAAGGACTGAGGCAGCCATCTGGGGGTAGCGGGCACTCTXATCAGAG 
CGGCTGGAGCCGGACCATCGTCCCAGAGAGCTGGGGCAGGGGGCCGTGCCCAATCTCCAGGGCTCCTGGGGCCAC 
TGCTGACCTGGCTGGATGCATCGGGCAGTGGATCCTCCAGGGGCCCGCGCTGCACGGGAAGCCTTTGCCCTTGGG 
GGCCTGAGCTGTGCTGGGGCCTGGAGCTCCTGCCCGCCTCATCCCCCTCCTCGTAGCGCATGGCTGCCTGGAGGC 
AGATGCTCAGCCAGCATTGGGCAGCCCCCGCTTCCTGCTCCCCTACCCCCTTCACATGGCAGTAGTTCTGGGCAC 
CCCAGCAAACCATATTATGCTCCAGGGGCGCCCACTCCAAGACCCCTCCATGGGAAGCTGGAATCCCTGCATGGC 
TGTGTGCAGGCATTGCTCCGGGAGCCAGCCCAGCCAGGGCTTTGGGAACAGCTTGGGCAACTGTACGAGTCAGAG 
CACGATAGTGAGGAGGCCACACGCTGCTACCACAGCGCCCTTCGATACGGAGGAAGCTTCGCTGAGCTGGGGCCC 
CGCATTGGCCGACTGCAGCAGGCCCAGCTCTGGAACTTTCATACTGGCTCCTGCCAGCACCGAGCCAAGGTCCTG 
CCCCCACTGGAGCAAGTGTGGAACTTGCTACACCTTGAGCACAAACGGAACTATGGAGCCAAGCGGGGAGGTCCC 
CCGGTGAAGCGAGCTGCTGAACCCCCAGTGGTGCAGCCTGTGCCTCCTGCAGCACTCTCAGGCCCCTCAGGGGAG 
GAGGGCCTCAGCCCTGGAGGCAAGCGAAGGAGAGGCTGCAACTCTGAACAGACTGGCCTTCCCCCAGGGCTGCCA 
CTGCCTCCACCACCATTACCACCACCACCACCACCACCACCACCACCCCTGCCTGGCCTGGCTACCAGCCCCCCA 
TTTCAGCTAACCAAGCCAGGGCTGTGGAGTACCCTGCATGGAGATGCCTGGGGCCCAGAGCGCAAGGGTTCAGCA 
CCCCCAGAGCGCCAGGAGCAGCGGCACTCGCTGCCTCACCCATATCCATACCCAGCTCCAGCGTACACCGCGCAC 
CCCCCTGGCCACCGGCTGGTCCCGGCTGCTCCCCCAGGCCCAGGCCCCCGCCCCCCAGGAGCAGAGAGCCATGGC 
TGCCTGCCTGCCACCCGTCCCCCCGGAAGTGACCTTAGAGAGAGCAGAGTTCAGAGGTCGCGGATGGACTCCAGC 
GTTTCACCAGCAGCAACCACCGCCTGCGTGCCTTACGCCCCTTCCCGGCCCCCTGGCCTCCCCGGCACCACCACC 
AGCAGCAGCAGTAGCAGCAGCAGCAACACTGGTCTCCGGGGCGTGGAGCCGAACCCAGGCATTCCCGGCGCTGAC 
CATTACCAAACTCCCGCGCTGGAGGTCTCTCACCATGGCCGCCTGGGGCCCXCGGCACACAGCAGTCGGAAACCG 
TTCTTGGGGGCTCCCGCTGCCACTCCCCACCTATCCCTGCCACCTGGACCTTCCTCACCCCCTCCACCCCCCTGT 
CCCCGCCTCTTACGCCCCCCACCACCCCCTGCCTGGTTGAAGGGTCCGGCCTGCCGGGCAGCCCGAGAGGATGGA 
GAGATCTTAGAAGAGCTCTTCTTTGGGACTGAGGGACCCCCCCGCCCTGCCCCACCACCCCTCCCCCATCGCGAG 
GGCTTCTTGGGGCCTCCGGCCTCCCGCTTTTCTGTGGGCACTCAGGATTCTCACACCCCTCCCACTCCCCCAACC 
CCAACCACCAGCAGTAGCAACAGCAACAGTGGCAGCCACAGCAGCAGCCCTGCTGGGCCTGTGTCCTTTCCCCCA 
CCACCCTATCTGGCCAGAAGTATAGACCCCCTTCCCCGGCCTCCCAGCCCAGCACAGAACCCCCAGGACCCACCT 
CTTGTACCCCTGACTCTTGCCCTGCCTCCAGCCCCTCCTTCCTCCTGCCACCAAAATACCTCAGGAAGCTTCAGG 
CGCCCGGAGAGCCCCCGGCCCAGGGTCTCCTTCCCAAAGACCCCCGAGGTGGGGCCGGGGCCACCCCCAGGCCCC 
CTGAGTAAAGCCCCCCAGCCTGTGCCGCCCGGGGTTGGGGAGCTGCCTGCCCGAGGCCCTCGACTCTTTGATTTT 
CCCCCCACTCCGCTGGAGGACCAGTTTGAGGAGCCAGCCGAATTCAAGATCCTACCTGATGGGCTGGCCAACATC 
ATGAAGATGCTGGACGAATCCATTCGCAAGGAAGAGGAACAGCAACAACACGAAGCAGGCGTGGCCCCCCAACCC 
CCGCTGAAGGAGCCCTTTGCAXCTCTGCAGTCTCCTXXCCCCACCGACACAGCCCCCACCACTACTGCTCCXGCT 
GXCGCCGXCACCACCACCACCACCACCACCACCACCACCACGGCCACCCAGGAAGAGGAGAAGAAGCCACCACCA 
GCCCXACCACCACCACCGCCXCTAGCCAAGXXCCCTCCACCCTCTCAGCCACAGCCACCACCACCCCCACCCCCC 
AGCCCGGCCAGCCXGCXCAAATCCXXGGCCXCCGTGCXGGAGGGACAAAAGXACXGXXATCGGGGGACXGGAGCA 
GCXGXTTCCACCCGGCCXGGGCCCTTGCCCACCACXCAGXAXTCCCCXGGCCCCCCATCAGGXGCXACCGCCCTG 
CCGCCCACCTCAGCGGCCCCXAGCGCCCAGGGCTCCCCACAGCCCXCXGCTTCCXCGXCATCXCAGXXCXCXACC 
TCAGGCGGGCCCTGGGCCCGGGAGCGCAGGGCGGGCGAAGAGCCAGTCCCGGGCCCCAXGACCCCCACCCAACCG 
CCCCCACCCCXATCTCXGCCCCCXGCXCGCXCTGAGXCXGAGGXGCXAGAAGAGATCAGCCGGGCTXGCGAGACC 
CTXGXGGAGCGGGXGGGCCGGAGXGCCACXGACCCAGCCGACCCAGXGGACACAGCAGAGCCAGCGGACAGXGGG 
ACTGAGCGACTGCXGCCCCCCGCACAGGCCAAGGAGGAGGCXGGCGGGGTGGCGGCAGTGXCAGGCAGCTGXAAG 
CGGCGACAGAAGGAGCATCAGAAGGAGCAXCGGCGGCACAGGCGGGCCXGXAAGGACAGXGXGGGXCGXCGGCCC 
CGXGAGGGCAGGGCAAAGGCCAAGGCCAAGGXCCCCAAAGAAAAGAGCCGCCGGGXGCXGGGGAACCXGGACCXG 
CAGAGCGAGGAGAXCCAGGGXCGTGAGAAGTCCCGGCCCGATCXXGGCGGGGCCTCCAAGGCCAAGCCACCCACA 
GCXCCAGCCCCXCCAXCAGCXCCXGCACCXXCXGCCCAGCCCACACCCCCGXCAGCCXCXGXCCCXGGAAAGAAG 
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GCTCGGGAGGAAGCCCCAGGGCCACCGGGTGTCAGCCGGGCCGACATGCTGAAGCTGCGCTCACTTAGTGAGGGG 
CCCCCCAAGGAGCTGAAGATCCGGCTCATCAAGGTAGAGAGTGGTGACAAGGAGACCTTTATCGCCTCTGAGGTG 
GAAGAGCGGCGGCTGCGCATGGCAGACCTCACCATCAGCCACTGTGCTGCTGACGTCGTGCGCGCCAGCAGGAAT 
GCCAAGGTGAAAGGGAAGTTTCGAGAGXCCTACCTTTCCCCTGCCCAGTCTGTGAAACCGAAGATCAACACTGAG 
GAGAAGCTGCCCCGGGAAAAACTCAACCCCCCTACACCCAGCATCTATCTGGAGAGCAAACGGGATGCCTTCTCA 
CCTGTCCTGCTGCAGTTCTGTACAGACCCTCGAAATCCCATCACAGTGATCCGGGGCCTGGCGGGCTCCCTGCGG 
CTCAACTTGGGCCTCTTCTCCACCAAGACCCTGGTGGAAGCGAGTGGCGAACACACCGTGGAAGTTCGCACCCAG 
GTGCAGCAGCCCTCAGATGAGAACTGGGATCTGACAGGCACTCGGCAGATCTGGCCTTGTGAGAGCTCCCGTTCC 
CACACCACCATTGCCAAGTACGCACAGTACCAGGCCTCATCCTTCCAGGAGTCTCTGCAGGAGGAGAAGGAGAGT 
GAGGATGAGGAGTCAGAGGAGCCAGACAGCACCACTGGAACCCCTCCTAGCAGCGCACCAGACCCGAAGAACCAT 
CACATCATCAAGTTTGGCACCAACATCGACTTGTCTGATGCTAAGCGGTGGAAGCCCCAGCTGCAGGAGCTGCTA 
AAGCTGCCCGCCTTCATGCGGGTAACATCCACGGGCAACATGCTGAGCCACGTGGGCCACACCATCCTGGGCATG 
AACACGGTGCAGCTGTACATGAAGGTGCCCGGCAGCCGAACGCCAGGCCACCAGGAGAATAACAACTTCTGCXCC 
GTCAACATCAACATTGGCCCAGGCGACTGCGAGTGGTTCGCGGTGCACGAGCACTACTGGGAGACCATCAGCGCT 
TTCTGTGATCGGCACGGCGTGGACTACTTGACGGGTTCCTGGTGGCCAATCCTGGATGATCTCTATGCATCCAAT 
ATTCCTGTGTACCGCTTCGTGCAGCGACCCGGAGACCTCGTGTGGATTAATGCGGGGACTGTGCACTGGGTGCAG 
GCCACCGGCTGGTGCAACAACATTGCCTGGAACGTGGGGCCCCTCACCGCCTATCAGTACCAGCTGGCCCTGGAA 
CGATACGAGTGGAATGAGGTGAAGAACGTCAAATCCATCGTGCCCATGATTCACGTGTCATGGAACGTGGCTCGC 
ACGGTCAAAATCAGCGACCCCGACTTGTTCAAGATGATCAAGTTCTGCCTGCTGCAGXCCATGAAGCACTGCCAG 
GTGCAACGCGAGAGCCTGGTGCGGGCAGGGAAGAAAATCGCTTACCAGGGCCGTGTCAAGGACGAGCCAGCCTAC 
TACTGCAACGAGTGCGATGTGGAGGTGTTTAACATCCTGTTCGTGACAAGTGAGAATGGCAGCCGCAACACGTAC 
CTGGTACACTGCGAGGGCTGTGCCCGGCGCCGCAGCGCAGGCCTGCAGGGCGTGGTGGTGCTGGAGCAGTACCGC 
ACXGAGGAGCTGGCTCAGGCCTACGACGCCTTCACGCTGGTGAGGGCCCGGCGGGCGCGCGGGCAGCGGAGGAGG 
GCACXGGGGCAGGCTGCAGGGACGGGCTTCGGGAGCCCGGCCGCGCCXXXCCCTGAGCCCCCGCCGGCTTTCXCC 
CCCCAGGCCCCAGCCAGCACGXCGCGATGAGGCCGGACGCCCCGCCCGCCTGCCXGCCCGCGCAAGGCGCCGCGG 
GGCCACCAGCACATGCCTGGGCXGGACCXAGGTCCCGCCTGXGGCCGAGAAGGGGGTCGGGCCCAGCCCXXCCAC 
CCCATXGGCAGCTCCCCXCACXXAAXTTATTAAGAAAAACTTXTTTXXXXTTTTTAGCAAATAXGAGGAAAAAAG 
GAAAAAAAATGGGAGACGGGGGAGGGGGCTGGCAGCCCCTCGCCCACCAGCGCCXCCCCTCACCGACTTTGGCCX 
TXXTAGCAACAGACACAAGGACCAGGCXCCGGCGGCGGCGGGGGTCACATACGGGTTCCCXCACCCXGCCAGCCG 
CCCGCCCGCCCGGCGCAGAXGCACGCGGCTCGTGXAXGXACATAGACGXXACGGCAGCCGAGGXXXTTAATGAGA 
XTCTTTCTATGGGCTTXACCCCTCCCCCGGAACCXCCTXXXTXACTTCCAATGCXAGCTGTGACCCCXGXACATG 
XCXCTTXATTCACXXGGXTATGAXXXGXAXTTTTXGXXCXXXXCXXGXXXXXTTGXXTXXAAXXXAXAACAGTCC 
CACXCACCXCXAXXTATTCATTTTTGGGAAAACCCGACCTCCCACACCCCCAAGCCATCCXGCCCGCCCCXCCAG 
GGACCGCCCGTCGCCGGGCTCTCCCCGCGCCCCAGTGTGTGTCCGGGCCCGGCCCGACCGTCTCCACCCGXCCGC 
CCGCGGCXCCAGCCGGGTTCXCAXGGXGCTCAAACCCGCTCCCCXCCCCXACGTCCXGCACXXTCTCGGACCAGT 
CCCCCCACTCCCGACCCGACCCCAGCCCCACCXGAGGGXGAGCAACTCCTGXACTGTAGGGGAAGAAGXGGGAAC 
XGAAAXGGTATTXTGXAAAAAAAAATAAATAAAATAAAAAAATTAAAGGTTTTAAAGAAAGAACTAXGAGGAAAA 
GGAACCCCGTCCXTCCCAGCCCCGGCCAACXXTAAAAAACACAGACCTXCACCCCCACCCCCXXXXCXXXTXAAG 
XGTGAAACAACCCAGGGCCAGGGCCTCACTGGGGCAGGGACACCCCGGGGTGAGTTTCTCXGGGGCTTTATTTTC 
GXTXXGTXGGXXGTTTTXTCTCCACGCTGGGGCXGCGGAGGGGTGGGGGGTTTACAGXCCCGCACCCXCGCACXG 
CACTGXCXCXCTGCCCCAGGGGCAGAGGGGTCXTCCCAACCCTACCCCTATTTTCGGXGAXXXXXGTGXGAGAAT 
AX X AAX AX X AAAAAT AAAC GG AG 
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GHPSKPYYAPGAPTPRPLHGKLESLHGCVQALLREPAQPGLWEQLGQLYESEHDSEEATRCYHSALRYGGSFAEL 

GPRIGRLQQAQLWNFHTGSCQHRAKVLPPLEQVWNLLHLEHKRNYGAKRGGPPVKRAAEPPWQPVPPAALSGPS 

GEEGLSPGGKRRRGCNSEQTGLPPGLPLPPPPLPPPPPPPPPPPPPPPLPGLATSPPFQLTKPGLWSTLHGDAWG 

PERKGSAPPERQEQRHSLPHPYPYPAPAYTAHPPGHRLVPAAPPGPGPRPPGAESHGCLPATRPPGSDLRESRVQ 

RSRMDSSVSPAATTACVPYAPSRPPGLPGTTTSSSSSSSSNTGLRGVEPNPGIPGADHYQTPALEVSHHGRLGPS 

AHSSRKPFLGAPAATPHLSLPPGPSSPPPPPCPRLLRPPPPPAWLKGPACRAAREDGEILEELFFGTEGPPRPAP 

PPLPHREGFLGPPASRFSVGTQDSHTPPTPPTPTTSSSNSNSGSHSSSSAGPVSFPPPPYLARSIDPLPRPPSPA 

QNPQDPPLVPLTLALPPAPPSSCHQNTSGSFRRPESPRPRVSFPKTPEVGPGPPPGPLSKAPQPVPPGVGELPAR 

GPRLFDFPPTPLEDQFEEPAEFKILPDGLANIMKMLDESIRKEEEQQQHEAGVAPQPPLKEPFASLQSPFPTDTA 

PTTTAPAVAVTTTTTTTTTTTATQEEEKKPPPALPPPPPLAKFPPPSQPQPPPPPPPSPASLLKSLASVLEGQKY 

CYRGTGAAVSTRPGPLPTTQYSPGPPSGATALPPTSAAPSAQGSPQPSASSSSQFSTSGGPWARERRAGEEPVPG 

PMTPTQPPPPLSLPPARSESEVLEEISRACETLVERVGRSATDPADPVDTAEPADSGTERLLPPAQAKEEAGGVA 

AVSGSCKRRQKEHQKEHRRHRRACKDSVGRRPREGRAKAKAKVPKEKSRRVLGNLDLQSEEIQGREKSRPDLGGA 

SKAKPPTAPAPPSAPAPSAQPTPPSASVPGKKAREEAPGPPGVSRADMLKLRSLSEGPPKELKIRLIKVESGDKE 

TFIASEVEERRLRMADLTISHCAADWRASRNAKVKGKFRESYLSPAQSVKPKINTEEKLPREKLNPPTPSIYLE 

SKRDAFSPVLLQFCTDPRNPITVIRGLAGSLRLNLGLFSTKTLVEASGEHTVEVRTQVQQPSDENWDLXGTRQIW 

PCESSRSHTTIAKYAQYQASSFQESLQEEKESEDEESEEPDSTTGTPPSSAPDPKNHHIIKFGTNIDLSDAKRWK 

PQLQELLKLPAFMRVTSTGNMLSHVGHTILGMNTVQLYMKVPGSRTPGHQENNNFCSVNIWIGPGDCEWFAVHEH 

YWETISAFCDRHGVDYLTGSWWPILDDLYASNIPVYRFVQRPGDLVWINAGTVHWVQATGWCNNIAWNVGPLTAY 

QYQLALERYEWNEVKNVKSIVPMIHVSWNVARTVKISDPDLFKMIKFCLLQSMKHCQVQRESLVRAGKKIAYQGR 

VKDEPAYYCNECDVEVFNILFVTSENGSRNTYLVHCEGCARRRSAGLQGVWLEQYRTEELAQAYDAFTLVRARR 

ARGQRRRALGQAAGTGFGSPAAPFPEPPPAFSPQAPASTSR 
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ATG GCGCCGCCGCCGCCGCCCGTGCT6CCCGT6CTGCTGCTCCTG6CCGCCGCCGCCGCCCTGCCGGC6ATG6GG 
CTGCGAGCGGCCGCCTGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTAC 

GCGGTGGGCGCCGCTTGCACGCCCCGGGCGCCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGA 
CGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTCCGCTTGGTGGCCCGCAGTGCCCCGACGGCG 
CTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTCTGCGGAACCGGTGCC 
CGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG 
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCG 
GGCGGCTCGGTCCGCCTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTG 
GAGGCCGCCACCGCGGGGACGCCCTCCGCGTCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCC 
CGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGAGGGAGCCTGAAGTTTCCGATGCCCAACXAC 
CAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCACTACACCATCGAGGGC 
GAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT 
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCC 
GTGGACTACAGTACGCCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGC 
CCGGTCTTCGAGCAGTCGGAGTACCGCGAGCGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATC 
CGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTACCGCGXGTTGGGGGGCGCGTGGGACGTCTTC 
CAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCGGCCGAGTACCAGCTC 
CTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG 
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAAC 
ACGGCTGTGCTGCGAGTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGC 
GGGAACGTGGCCGGCCAGTTCTACCTGCACXCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAG 
GATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGCCGGCCCCCGCTCATCAATTCTTCAGGGGTG 
GTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTCCAGGCCACGGTGCTG 
GAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCXG 
CACTATCGCCTGGTGGACACGGCCXCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACC 
CCXGACXXCCCCXTCCAGATCCACAACAGCXCCGGXXGGAXCACAGTGTGTGCCGAGCXGGACCGCGAGGAGGTG 
GAGCACTACAGCXTCGGGGXGGAGGCGGTGGACCACGGCTCGCCCCCCATGAGCTCCXCCACCAGCGXGXCCAXC 
ACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCCACCTACGAGCTTCGTCTGAATGAGGATGCG 
GCCGXGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGXGAXXACCTACCAGCTCACA 
GGCGGCAACACCCGGAACCGCXXXGCACXCAGCAGCCAGAGAGGGGGCGGCCXCAXCACCCTGGCGCXACCXCXG 
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACXGCGCATGTC 
CTAATCAACGXCACTGATGCCAACACCCACAGGCCTGTCXTXCAGAGCTCCCATXACACAGXGAGXGTCAGTGAG 
GACAGGCCXGXGGGCACCXCCATTGCTACCCXCAGXGCCAACGATGAGGACACAGGAGAGAAXGCCCGCAXCACC 
TACGTGATTCAGGACCCCGXGCCGCAGXTCCGCATTGACCCCGACAGTGGCACCATGTACACCATGAXGGAGCTG 
GACTATGAGAACCAGGXCGCCXACACGCXGACCATCATGGCCCAGGACAACGGCATCCCGCAGAAATCAGACACC 
ACCACCCXAGAGATCCTCATCCTCGATGCCAAXGACAATGCACCCCAGTXCCTGTGGGAXXXCXACCAGGGTXCC 
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGG 
CGXCXGCTGXACACCTTCCAGGGTGGGGACGACGGCGATGGGGACTXCXACATCGAGCCCACGXCCGGXGTGATT 
CGCACCCAGCGCCGGCXGGACCGGGAGAAXGTGGCCGTGTACAACCXXTGGGCTCTGGCTGTGGATCGGGGCAGT 
CCCACTCCCCTXAGCGCCTCGGXAGAAAXCCAGGXGACCAXCTTGGACATXAAXGACAAXGCCCCCAXGTXTGAG 
AAGGACGAACXGGAGCXGXXXGXXGAGGAGAACAACCCAGTGGGGTCGGXGGTGGCAAAGATXCGTGCTAACGAC 
CCXGAXGAAGGCCCXAATGCCCAGATCAXGXAXCAGAXXGXGGAAGGGGACAXGCGGCAXXXCTTCCAGCXGGAt 
CXGCXCAACGGGGACCXGCGTGCCAXGGXGGAGCTGGACXTTGAGGXCCGGCGGGAGXAXGTGCTGGXGGXGCAG 
GCCACGTCGGCTCCGCTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCXGTG 
CTGCCCGACTXCCAGAXCCXCXXCAACAACTATGXCACCAACAAGXCCAACAGTTTCCCCACCGGCGXGATCGGC 
TGCAXCCCGGCCCAXGACCCCGACGXGTCAGACAGCCXCAACXACACCXTCGTGCAGGGCAACGAGCXGCGCCTG 
XXGCTGCTGGACCCCGCCACGGGCGAACTGCAGCXCAGCCGCGACCTGGACAACAACCGGCCGCTGGAGGCGCTC 
AXGGAGGXGXCXGXGTCTGATGGCATCCACAGCGXCACGGCCXXCXGCACCCXGCGTGXCACCATCAXCACGGAC 
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GACATGCTGACCAACAGCATCACTGTCCGCCTGGAGAACATGTCCCAGGAGAAGTTCCTGTCCCCGCTGCTGGCC 
CTCTTCGTGGAGGGGGTGGCCGCCGTGCTGTCCACCACCAAGGACGACGTCTTCGTCTTCAACGTCCAGAACGAC 

ACCGACGTCAGCTCCAACATCCTGAACGTGACCTTCTCGGCGCTGCTGCCTGGCGGCGTCCGCGGCCAGTTCTTC 
CCGTCGGAGGACCTGCAGGAGCAGATCTACCTGAATCGGACGCTGCTGACCACCATCTCCACGCAGCGCGTGCTG 
CCCTTCGACGACAACATCTGCCTGCGCGAGCCCTGCGAGAACTACATGAAGTGCGTGTCCGTTCTGCGATTCGAC 
AGCTCCGCGCCCTTCCTCAGCTCCACCACCGTGCTCTTCCGGCCCATCCACCCCATCAACGGCCTGCGCTGCCGC 
TGCCCGCCCGGCTTCACCGGCGACTACTGCGAGACGGAGATCGACCTCTGCTACTCCGACCCGTGCGGCGCCAAC 
GGCCGCTGCCGCAGCCGCGAGGGCGGCTACACCTGCGAGTGCTTCGAGGACTTCACTGGAGAGCACTGTGAGGTG 
GATGCCCGCTCAGGCCGCTGTGCCAACGGGGTGTGCAAGAACGGGGGCACCTGCGTGAACCTGCTCATCGGCGGC 
TTCCACTGCGTGTGTCCTCCTGGCGAGTATGAGAGGCCCTACTGTGAGGTGACCACCAGGAGCTTCCCGCCCCAG 
TCCTTCGTCACCTTCCGGGGCCTGAGACAGCGCTTCCACTTCACCATCTCCCTCACGTTTGCCACTCAGGAAAGG 
AACGGCTTGCTTCTCTACAACGGCCGCTTCAATGAGAAGCACGACTTCATCGCCCTGGAGATCGTGGACGAGCAG 
GTGCAGCTCACCTTCTCTGCAGGCGAGACAACAACGACCGTGGCACCGAAGGTTCCCAGTGGTGTGAGTGACGGG 
CGGTGGCACTCTGTGCAGGTGCAGTACTACAACAAGCCCAATATTGGCCACCTGGGCCTGCCCCATGGGCCGTCC 
GGGGAAAAGATGGCCGTGGTGACAGTGGATGATTGTGACACAACCATGGCTGTGCGCTTTGGAAAGGACATCGGG 
AACTACAGCTGCGCTGCCCAGGGCACTCAGACCGGCTCCAAGAAGTCCCTGGATCTGACCGGCCCTCTACTCCTG 
GGGGGTGTCCCCAACCTGCCAGAAGACTTCCCAGTGCACAACCGGCAGTTCGTGGGCTGCATGCGGAACCTGTCA 
GTCGACGGCAAAAATGTGGACATGGCCGGATTCATCGCCAACAATGGCACCCGGGAAGGCTGCGCTGCTCGGAGG 
AACTTCTGCGATGGGAGGCGGTGTCAGAATGGAGGCACCTGTGTCAACAGGTGGAATATGTATCTGTGTGAGTGX 
CCACTCCGATTCGGCGGGAAGAACTGTGAGCAAGCCATGCCTCACCCCCAGCTCXTCAGCGGTGAGAGCGTCGTG 
TCCTGGAGTGACCTGAACATCATCATCTCTGTGCCCTGGTACCTGGGGCTCATGTTCCGGACCCGGAAGGAGGAC 
AGCGTTCTGATGGAGGCCACCAGTGGTGGGCCCACCAGCTTTCGCCTCCAGATCCTGAACAACTACCTCCAGTTT 
GAGGTGTCCCACGGCCCCTCCGATGTGGAGTCCGTGATGCTGTCCGGGTTGCGGGTGACCGACGGGGAGTGGCAC 
CACCTGCTGATCGAGCTGAAGAATGTTAAGGAGGACAGTGAGATGAAGCACCTGGTCACCATGACCTTGGACTAT 
GGGATGGACCAGAACAAGGCAGATATCGGGGGCATGCTTCCCGGGCTGACGGTAAGGAGCGTGGTGGTCGGAGGC 
GCCTCTGAAGACAAGGTCTCCGTGCGCCGTGGATTCCGAGGCTGCATGCAGGGAGTGAGGAXGGGGGGGACGCCC 
ACCAACGTCGCCACCCTGAACATGAACAACGCACTCAAGGTCAGGGTGAAGGACGGCTGTGATGTGGACGACCCC 
TGTACCTCGAGCCCCTGTCCCCCCAATAGCCGCTGCCACGACGCCTGGGAGGACTACAGCTGCGTCTGTGACAAA 
GGGTACCTTGGAAXAAACTGTGTGGATGCCTGTCACCTGAACCCCTGCGAGAACATGGGGGCCTGCGTGCGCTCC 
CCCGGCTCCCCGCAGGGCTACGXGXGCGAGXGXGGGCCCAGXCACXACGGGCCGTACXGXGAGAACAAACTCGAC 
CTTCCGXGCCCCAGAGGCTGGTGGGGGAACCCCGTCTGTGGACCCXGCCACTGTGCCGTCAGCAAAGGCXXXGAX 
CCCGACXGXAAXAAGACCAACGGCCAGXGCCAAXGCAAGGAGAAXXACXACAAGCXCCXAGCCCAGGACACCTGX 
CTGCCCXGCGACXGCXTCCCCCATGGCXCCCACAGCCGCACXXGCGACAXGGCCACCGGGCAGXGXGCCXGCAAG 
CCCGGCGXCATCGGCCGCCAGXGCAACCGCXGCGACAACCCGXXXGCCGAGGXCACCACGCTCGGCTGTGAAGXG 
AXCTACAAXGGCTGTCCCAAAGCAXXXGAGGCCGGCAXCXGGTGGCCACAGACCAAGXXCGGGCAGCCGGCTGCG 
GXGCCAXGCCCXAAGGGAXCCGTXGGAAAXGCGGXCCGACACXGCAGCGGGGAGAAGGGCXGGCXGCCCCCAGAG 
CTCTTTAACTGTACCACCATCTCCTTCGTGGACCTCAGGGCCATGAATGAGAAGCTGAGCCGCAATGAGACGCAG 
GXGGACGGCGCCAGGGCCCXGCAGCXGGXGAGGGCGCXGCGCAGXGCXACACAGCACACGGGCACGCXCXXXGGC 
AATGACGXGCGCACGGCCXACCAGCXGCXGGGCCACGXCCXXCAGCACGAGAGCXGGCAGCAGGGCTTCGACCTG 
GCAGCCACGCAGGACGCCGACXTXCACGAGGACGXCAXCCACXCGGGCAGCGCCCXCCXGGCCCCAGCCACCAGG 
GCGGCGXGGGAGCAGAXCCAGCGGAGCGAGGGCGGCACGGCACAGCTGCXCCGGCGCCXCGAGGGCXACXXCAGC 
AACGXGGCACGCAACGXGCGGCGGACGTACCTGCGGCCCXXCGXCAXCGTCACCGCCAACAXGAXXCXXGCTGTC 
GACAXCXTXGACAAGXXCAACXXXACGGGAGCCAGGGXCCCGCGAXXCGACACCAXCCAXGAAGAGXXCCCCAGG 
GAGCXGGAGXCCXCCGXCTCCXXCCCAGCCGACTXCTXCAGACCACCTGAAGAAAAAGAAGGCCCCCXGCTGAGG 
CCGGCXGGCCGGAGGACCACCCCGCAGACCACGCGCCCGGGGCCXGGCACCGAGAGGGAGGCCCCGAXCAGCAGG 
CGGAGGCGACACCCXGAXGACGCXGGCCAGXXCGCCGXCGCXCXGGXCAXCAXXXACCGCACCCXGGGGCAGCXC 
CXGCCCGAGCGCTACGACCCCGACCGXCGCAGCCTCCGGXXGCCTCACCGGCCCATCATXAATACCCCGAXGGXG 
AGCACGCXGGXGXACAGCGAGGGGGCXCCGCXCCCGAGACCCCXGGAGAGGCCCGXCCXGGTGGAGXXCGCCCXG 
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CTGGAGGTGGAGGAGCGAACCAAGCCTGTCTGCGTGTTCTGGAACCACTCCCTGGCCGTTGGTGGGACGGGAGGG 
TGGTCTGCCCGGGGCTGCGAGCTCCTGTCCAGGAACCGGACACATGTCGCCTGCCAGTGCAGCCACACAGCCAGC 
TTTGCGGTGCTCATGGATATCTCCAGGCGTGAGAACGGGGAGGTCCTGCCTCTGAAGATTGTCACCTATGCCGCT 
GTGTCCTTGTCACTGGCAGCCCTGCTGGXGGCCTTCGTCCTCCTGAGCCTGGTCCGCATGCTGCGCTCCAACCTG 
CACAGCATTCACAAGCACCTCGCCGTGGCGCTCTTCCTCTCTCAGCTGGTGTTCGTGATTGGGATCAACCAGACG 
GAAAACCCGTTTCTGTGCACAGTGGTTGCCATCCTCCTCCACTACATCTACATGAGCACCTTTGCCTGGACCCTC 
GTGGAGAGCCTGCATGTCTACCGCATGCTGACCGAGGTGCGCAACATCGACACGGGGCCCATGCGGXTCTACTAC 
GTCGTGGGCTGGGGCATCCCGGCCATTGTCACAGGACTGGCGGTCGGCCTGGACCCCCAGGGCTACGGGAACCCC 
GACTTCTGCTGGCTGTCGCTTCAAGACACCCTGATTTGGAGCTTTGCGGGGCCCATCGGAGCTGTTATAATCATC 
AACACAGTCACTTCTGTCCTATCTGCAAAGGTXTCCTGCCAAAGAAAGCACCATTATTATGGGAAAAAAGGGATC 
GTCTCCCTGCTGAGGACCGCATTCCTCCTGCTGCTGCTCATCAGCGCCACCTGGCTGCTGGGGCTGCTGGCTGTG 
AACCGCGATGCACTGAGCTTTCACTACCTCTTCGCCATCTTCAGCGGCTTACAGGGCCCCTTCGTCCTCCTTTTC 
CACTGCGTGCTCAACCAGGAGGTCCGGAAGCACCTGAAGGGCGTGCTCGGCGGGAGGAAGCTGCACCTGGAGGAC 
TCCGCCACCACCAGGGCCACCCTGCTGACGCGCTCCCTCAACTGCAACACCACCTTCGGTGACGGGCCTGACATG 
CTGCGCACAGACTTGGGCGAGTCCACCGCCTCGCTGGACAGCATCGTCAGGGATGAAGGGATCCAGAAGCTCGGC 
GTGTCCTCTGGGCTGGTGAGGGGCAGCCACGGAGAGCCAGACGCGTCCCTCATGCCCAGGAGCTGCAAGGATCCC 
CCTGGCCACGATTCCGACTCAGATAGCGAGCTGTCCCTGGATGAGCAGAGCAGCTCTTACGCCTCCTCACACTCG 
TCAGACAGCGAGGACGATGGGGTGGGAGCTGAGGAAAAATGGGACCCGGCCAGGGGCGCCGTCCACAGCACCCCC 
AAAGGGGACGCTGTGGCCAACCACGTTCCGGCCGGCTGGCCCGACCAGAGCCTGGCTGAGAGTGACAGTGAGGAC 
CCCAGCGGCAAGCCCCGCCTGAAGGTGGAGACCAAGGTCAGCGTGGAGCTGCACCGCGAGGAGCAGGGCAGTCAC 
CGTGGAGAGTACCCCCCGGACCAGGAGAGCGGGGGCGCAGCCAGGCTTGCTAGCAGCCAGCCCCCAGAGCAGAGG 
AAAGGCATCTTGAAAAATAAAGTCACCTACCCGCCGCCGCTGACGCTGACGGAGCAGACGCTGAAGGGCCGGCTC 
CGGGAGAAGCTGGCCGACTGTGAGCAGAGCCCCACATCCTCGCGCACGTCTTCCCTGGGCTCTGGCGGCCCCGAC 
TGCGCCATCACAGTCAAGAGCCCTGGGAGGGAGCCGGGGCGTGACCACCTCAACGGGGTGGCCATGAATGTGCGC 
ACTGGGAGCGCCCAGGCCGATGGCTCCGACTCTGAGAAACCGTGAGGCAAGCCCGTCACCCCACACAGGCTGCGG 
CATCACCCTCAGACCTTGGAGCCCAAGGGGCCACTGCCCTTGAAGTGGAGTGGGCCCAGAGTGTGGCGGTCCCCA 
TGGTGGCAGCCCCCCGACTGATCATCCAGACACAAAGGTCTTGGTTCTCCCAGGAGCTCAGGGCCTGTCAGACCT 
GGTGACAAGTGCCAAAGGCCACAGGCATGAGGGAGGCGTGGACCACTGGGCCAGCACCGCTGAGTCCTAAGACTG 
CAGTCAAAGCCAGAACTGAGAGGGGACCCCAGACTGGGCCCAGAGGCTGGCCAGAGTTCAGGAACGCCGGGCACA 
GACCAAAGACCGCGGTCCAGCCCCGCCCAGGCGGGCATCTCATGGCAGTGCGGACCCGTGGCTGGCAGCCCGGGC 
AGTCCTTTGCAAAGGCACCCCTTGTCTTAAAATCACTTCGCTATGTGGGAAAGGTGGAGATACTTTTATATATTT 
GTATGGGACTCTGAGGAGGTGCAACCTGTATATATAXTGCATTCGTGCTGACTTTGTTATCCCGAGAGATCCATG 
CAATGAXCXCTXGCXGTCTXCTCXGXCAAGATXGCACAGTXGXACTTGAATCXGGCAXGXGXXGACGAAACXGGX 
GCCCCAGCAGAXCAAAGGXGGGAAATACGXCAGCAGXGGGGCTAAAACCAAGCGGCTAGAAGCCCTACAGCTGCC 
XTCGGCCAGGAAGTGAGGAXGGXGTGGGCCCXCCCCGCCGGCCCCCTGGGXCCCCAGTGXXCGCXGXGXGXGCGT 
XXGXCCTCXGCXGCCATCTGCCCCGGCTGTGTGAATTCAAGACAGGGCAGTGCAGCACXAGGCAGGTGXGAGGAG 
CCCTGCTGAGGTCACTGXGGGGCACGGTTGCCACACGGCTGTCATTTTTCACCTGGTCATXCTGTGACCACCACC 
CCCTCCCCXCACCGCCXCCCAGGXGGCCCGGGAGCXGCAGGXGGGGATGGCTTTGXCCXXXGCXCCXGCTCCCCG 
XGGGACCXGGGACCXXAAAGCGXTGCAGGTTCCTGAXXTGGACAGAGGTGTGGGGCCTTCCAGGCCGTTACATAC 
CXCCTGCCAAXXCTCTAACXCTCTGAGACXGCGAGGATCXCCAGGCAGGGTTCTCCCCTCXGGAGTCTGACCAAT 
TACTTCAXTTTGCTXCAAAXGGCCAATTGTGCAGAGGGACAAAGCCACAGCCACACXCXTCAACGGXXACCAAAC 
XGTTXXXGGAAATXCACACCAAGGXCGGGCCCACXGCAGGCAGCXGGCACAGCGXGGCCCGAGGGGCTGTGGAAC 
GGGTCCCGGAACXGTCAGACATGTXXGATXXXAGCGTXTCCXTXGXTCXTCAAAXCAGGTGCCCAAATAAGXGAT 
CAGCACAGCXGCTXCCAAAXAGGAGAAACCATAAAATAGGAXGAAAAXCAAGTAAAATGCAAAGATGTCCACACX 
GXXXXAAACXTGACCCXGATGAAAATGTGAGCACXGXXAGCAGAXGCCTATGGGAGAGGAAAAGCGTAXCTGAAA 
AXGGXCCAGGACAGGAGGATGAAAXGAGAXCCCAGAGTCCXCACACCXGAATGAATXATACAXGXGCCTTACCAG 
GXGAGXGGXCXXTCGAAGATAAAAAACTCXAGTCCCTXXAAACGXTTGCCCCTGGCGTTTCCTAAGXACGAAAAG 
GXXXTXAAGXCXXCGAACAGXCXCCXXTCAXGACXXTAACAGGAXXCXGCCCCCXGAGGXGTAATTXXXXXGXTC 
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TATTTTTTTCCACGTACTCCACAGCCAACATCACGAGGTGTAATTTTTAATTTGATCAGAACTGTTACCAAAAAA 
CAACTGTCAGTTTTATTGAGATGGGAAAAATGTAAACCTATTTTTATTACTTAAGACTTTATGGGAGAGATTAGA 

CACTGGAGGTTTTTAACAGAACGTGTATTTATTAATGTTCAAAACACTGGAATTACAAATGAGAAGAGTCTACAA 
TAAATTAAGATTXTTGAATTTGTACTTCTGCGGXGCTGGTTTTTCTCCACAAACACCCCCGCCCCTCCCCATGCC 
CAGGGTGGCCGTGGAAGGGACGGTTTACGGACGTGCAGCTGAGCTGTCCGTGTCCCATGCTCCCTCAGCCAGTGG 
AACGTGCCGGAACTTTTTGTCCATTCCCTAGTAGGCCTGCCACAGCCTAGATGGGCAGTTTTTGTCTTTCACCAA 
ATTTGAGGACTTTTTTTTTTTGCCATTATTTCTTCAGTTTTCTTTTCTTGCACTGATCTTTCTCCTCTCCTTCTG 
TGACTCCAGTGACTCAGACGTTAGACCTCTTGATGTTTTCCCACTGGTCCCTGAGGCTCTGTTC 
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MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAG 
RRRVSGAGRPLPLQVRLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAP 
TTLPACRCPPRPRPRCPGRPICLPPGGSVRLRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEA 
RAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAHYTIEGEEERVSYYMEGLFDERSRGYFRIDS 
ATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRERVRENLEVGYEVLTI 
RASDRDSPINANLRYRVLGGAWDVFQLNESSGWSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE 
DENDNYPQFSEQNYWQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFE 
DVQKYSLSIKAQDGGRPPLINSSGWSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPWHIQAVDADSGENARL 
HYRLVDTASTFLGGGSAGPKNPAPTPDFPFQIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSI 
TVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVITYQLTGGNTRNRFALSSQRGGGLITLALPL 
DYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIATLSANDEDTGENARIT 
YVIQDPVPQFRIDPDSGTMYTiy^ELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS 
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGS 
PTPLSASVEIQVTILDINDNAPMFEKDELELFVEENNPVGSWAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLD 
LLNGDLRAMVELDFEVRREYVLWQATSAPLVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIG 
CIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNRPLEALMEVSVSDGIHSVTAFCTLRVTIITD 
DMLTNSITVRLENMSQEKFLSPLLALFVEGVAAVLSTTKDDVFVFNVQNDTDVSSNILNVTFSALLPGGVRGQFF 
PSEDLQEQIYLNRTLLTTISTQRVLPFDDNICLREPCENYMKCVSVLRFDSSAPFLSSTTVLFRPIHPINGLRCR 
CPPGFTGDYCETEIDLCYSDPCGANGRCRSREGGYTCECFEDFTGEHCEVDARSGRCANGVCKNGGTCVNLLIGG 
FHCVCPPGEYERPYCEVTTRSFPPQSFVTFRGLRQRFHFTISLTFATQERNGLLLYNGRFNEKHDFIAIiEIVDEQ 
VQLTFSAGETTTTVAPKVPSGVSDGRWHSVQVQYYNKPNIGHLGLPHGPSGEKMAWTVDDCDTTMAVRFGKDIG 
NYSCAAQGTQTGSKKSLDLTGPLLLGGVPNLPEDFPVHNRQFVGCMRNLSVDGKNVDMAGFIANNGTREGCAARR 
NFCDGRRCQNGGTCVNRWNMYLCECPLRFGGKNCEQAMPHPQLFSGESWSWSDLNIIISVPWYLGLMFRTRKED 
SVLMEATSGGPTSFRLQILNNYLQFEVSHGPSDVESVMLSGLRVTDGEWHHLLIELKNVKEDSEMKHLVTMTLDY 
GMDQNKADIGGMLPGLTVRSVWGGASEDKVSVRRGFRGCMQGVRMGGTPTNVATLNMNNALKVRVKDGCDVDDP 
CTSSPCPPNSRCHDAWEDYSCVCDKGYLGINCVDACHLNPCENMGACVRSPGSPQGYVCECGPSHYGPYCENKLD 
LPCPRGWWGNPVCGPCHCAVSKGFDPDCNKTNGQCQCKENYYKLLAQDTC 
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GGCACGAGGGCCCGGCGGCAGGTCCCAGCCCGGGGCTAGAGACCGAGGGCCGGGGTCCGGGCCCGGCGGCGGGAC 
CCAGGCGGTTGAGGCTGGTCAGGAGTCAGCCAGCCTGAAAGAGCAGGMOSATCTTGATGTGGTTAACATGTTTG 
TGATTGCGGGCGGCACGCTGGCCATCCCAATCCTGGCATTTGTGGCTTCATTTCTTCTGTGGCCTTCAGCACTGA 
TAAGAATCTATTATTGGTACTGGCGGAGGACATTGGGCATGCAAGTCCGCTATGTTCACCATGAAGACTATCAGT 
TCTGTTATTCCTTCCGGGGCAGGCCTGGGCACAAACCCTCCATCCTCATGCTCCACGGATTCTCTGCCCACAAGG 
ATATGTGGCTCAGTGTGGTCAAGTTCCTTCCAAAGAACCTGCACTTGGTCTGCGTGGACATGCCAGGACATGAGG 
GCACCACCCGCTCCTCCCTGGAXGACCTGTCCATAGATGGGCAAGTTAAGAGGATACACCAGTTTGTAGAATGCC 
TGAAGCTGAACAAAAAACCTTTCCACCTGGTAGGCACCTCCATGGGTGGCCAGGTGGCTGGGGTGTATGCTGCTT 
ACTACCCATCGGATGTCTCCAGCCTGTGTCTCGTGTGTCCTGCTGGCCTGCAGTACTCAACTGACAATCAATTTG 
TACAACGGCTCAAAGAACTGCAGGGCTCTGCCGCCGTGGAGAAGATTCCCTTGATCCCGTCTACCCCAGAAGAGA 
TGAGTGAAATGCTTCAGCTCTGCTCCTATGTCCGCTTCAAGGTGCCCCAGCAGATCCTGCAAGGCCTTGTCGATG 
TCCGCATCCCTCATAACAACTTCTACCGAAAGTTGTTTTTGGAAATCGTCAGTGAGAAGTCCAGATACTCTCTCC 
ATCAGAACATGGACAAGATCAAGGTTCCGACGCAGAXCATCTGGGGGAAACAAGACCAGGTGCTGGATGTGTCTG 
GGGCAGACATGTTGGCCAAGTCAATTGCCAACTGCCAGGTGGAGCTTCTGGAAAACTGTGGGCACTCAGTAGTGA 
TGGAAAGACCCAGGAAGACAGCCAAGCTCATAATCGACTTTTTAGCTTCTGTGCACAACACAGACAACAACAAGA 
AGCTGGACTGAGGCCCCGACTGCAGCCXGCATTCTGCACACAGCATCXGCTCCCATCCCCCAAGTCTGACGCAGC 
CACCACTCXCAGGGAXCCTGCCCCAAAXGCGGXCGGAGCGCCAGXGACCCXGAGGAAGCCCGXCCCXXAXCCCXG 
GTAXCCACGGXXCCCCAGAGCXXXGGGGACCACGCGAAAACCTCCAAGAXAXXXXTCACAAAAXAGAAACXCAXA 
TGGAACAAAATAAGAAACCCCAGCCATGAAAXCXACCAXGAAGXCXXCAAGXXCAXGXCACTGAGAAGCXXGXGC 
AAAGCAGCCACCXTGGACCAXAAXXAAAXCAAGGACAXTXXCXXXGAGACAXXCCXXAXAGTXGGAGACXCAAGA 
XAXTTXXGXXGCAXCAGGXGXAXXCCCXTGCAXGGGCAGXGGCTTXXAXAGGAGCATXAGXCCXCATXCGCXGAA 
CCCTGXXGTTTAGGXCXAAXXTAAGTXXXACAXAGAGACCCATGXAXGACTGCAGCCCATTGGCTGCAAGACCAG 
GGAGGAAAGTGGCAAGCXGXAGAAAAXGXXXACACGCAXGGAGGGGCAXXGCTCXAGCCCXCAGAGCGXCCGGAG 
CAGCAGGGXACATGGGXGGGAGGTTCAXXCAGCACCCACCAGXCAGGXAXGTTCXGAGXGAACCCACAGCAGXCG 
CAGAATGAGCACCTGGCAGGGXGGGXTTCCTAGGAAXAAXTTATTAXXXTXAAAAATAGGCCTAAXAAAGCAAXA 
AXGXXCXAGACAXCXGXCTAAGXAAXCAGACXCAGGXXCCACACACAAGCAACAACXCGXGGGCCXCXTTXCXAX 
TTCAAXGXGCXACXAAGAACCCXXGGAXGXAACAXACXAGXXAGXXAAXGAAXXCXGXGAAXXCTGXGAAGAGXA 
ATGTGAXXGAAAAXAAGXCXAAACAGCXGXAAAAGXGACCACAAXGACAXGAAAXAAAXXXAATAAGXCXAGAXC 
AGCAAAAAAAAAAAAAAAAAA 
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FIGURE 56 

MDLDWNMFVIAGGTLAIPILAFVASFLLWPSALIRIYYWYWRRTLGMQVRYVHHEDYQFCYSFRGRPGHKPSIL 
MLHGFSAHKDMWLSWKFLPKNLHLVCVDMPGHEGTTRSSLDDLSIDGQVKRIHQFVECLKLNKKPFHLVGTSMG 
GQVAGVYAAYYPSDVSSLCLVCPAGLQYSTDNQFVQRLKELQGSAAVEKIPLIPSTPEEMSEMLQLCSYVRFKVP 
QQILQGLVDVRIPHNNFYRKLFLEIVSEKSRYSLHQNMDKIKVPTQIIWGKQDQVLDVSGADMLAKSIANCQVEL 
LENCGHSWMERPRKTAKLIIDFLASVHNTDNNKKLD 
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FIGURE 57A 

GGAACACAAAACTGCGGCCCTTGTCTGCAAACAGTGGTATCGACTTATCAAAGGTGTAGCCCATCAGTGTTATCA 
TGGTTTCATGAAGGCTGTCCAGGAAGGAAACATTCAGTGGGAGAGCCGTACCTATCCTTATCCTGGAACCCCAAT 
CACTCAGCGCTTCTCGCACAGTGCATGCTATTATGATGCTAATCAGTCTATGTATGTGTTTGGAGGCTGTACCCA 
GAGCAGCTGCAATGCTGCTTTCAATGACCTCTGGAGACXTGACCTAAACAGCAAAGAGTGGATCCGACCTTTGGC 
TTCAGGGTCCTATCCTTCCCCCAAAGCTGGAGCAACTCTGGTCGTGTACAAGGACTTGCTAGTGCTGTTTGGTGG 
CTGGACGCGGCCAAGCCCTTATCCCCTACACCAGCCAGAGAGATTCTTTGATGAAATACACACTTACTCACCCTC 
TAAAAATTGGTGGAACTGCATTGXGACAACCCATGGGCCACCTCCCATGGCTGGCCACTCCTCCTGTGTGATAGA 
TGATAAAATGAXTGTCTTTGGTGGCTCTTTAGGATCCCGGCAAATGAGCAATGATGTCTGGGTCCTTGACCTTGA 
GCAGTGGGCGTGGTCCAAGCCGAACATCTCTGGCCCCAGTCCTCATCCTCGAGGTGGCCAATCTCAGATTGTCAT 
AGATGATGCAACTATCTTAATCCTCGGAGGGTGTGGCGGTCCCAATGCTCTAXTCAAGGATGCTTGGTTGTTGCA 
CATGCATXCXGGXCCTTGGGCCTGGCAGCCACXCAAGGTAGAAAAXGAAGAGCAXGGGGCCCCAGAACXGTGGTG 
CCATCCAGCTTGCCGGGTGGGACAGXGTGTGGTGGTCXTCAGCCAGGCTCCTAGTGGGAGAGCCCCACTCAGCCC 
CAGXXXGAACXCXCGCCCATCACCTATCAGTGCCACXCCXCCAGCXCTCGTTCCXGAAACCCGAGAGXACCGCXC 
TCAGTCXCCAGXAAGAAGCAXGGAXGAAGCTCCXTGTGXXAACGGCCGCTGGGGAACACXGAGACCCAGGGCTCA 
AAGGCAGACXCCXTCAGGTTCCCGGGAAGGGAGCCTTTCCCCAGCCAGAGGAGACGGCTCXCCTATCCTCAATGG 
TGGGAGTTTGTCTCCAGGAACGGCAGCXGXGGGXGGCXCXXCTTXGGACAGXCCXGXACAGGCCAXAXCTCCAAG 
XACTCCATCXGCXCCTGAAGGATACGACCTGAAAAXAGGACXTXCTXTGGCCCCCCGACGAGGATCACTACCAGA 
TCAGAAAGAXCXGAGAXTAGGATCCATAGAXCTGAATTGGGATCXGAAACCCGCXTCCAGXAGXAATCCCAXGGA 
XGGCATGGACAAXAGGACAGTTGGGGGAAGTATGAGACACCCTCCXGAACAGACAAATGGTGTGCATACCCCACC 
XCACGXGGCCAGXGCCCXXGCAGGGGCCGTCTCCCCAGGTGCCCXGCGXCGGAGXCXGGAAGCCAXCAAAGCGAX 
GTCCTCCAAAGGCCCCTCGGCCTCTGCAGCACXAAGXCCXCCTCTXGGGXCXTCTCCAGGCXCXCCTGGGAGCCA 
GAGXXTGAGCAGTGGAGAAACAGXGCCCATCCCTCGCCCAGGGCCTGCCCAAGGAGATGGACAXXCCTTACCTCC 
CAXXGCXCGCCGCCXGGGCCACCACCCTCCACAGXCCCTAAAXGXTGGCAAACCCCTATACCAGAGXATGAACXG 
CAAGCCCATGCAGAXGXACGXGCTGGACAXTAAAGACACCAAGGAGAAGGGGCGGGXCAAATGGAAAGTAXXTAA 
XAGCAGTTCTGTGGTXGGACCTCCXGAAACCAGCCXGCATACCGXGGTACAAGGCAGGGGTGAACXCATCATATX 
XGGAGGACXCAXGGACAAGAAACAGAATGXGAAGTACXATCCAAAAACAAACGCCTTGXACXXTGTACGAGCAAA 
GAGATAATGTGXXCXAAACCCCTTTCCXXXXCTGXGGCXTTTAATXXGGAATTTXCCAGTGXGTAAGCATTTGGA 
CTGAGAATTGGGAAAACAAAATTACXCCCAGAAGCCAAAACTCXTXAAXTCCCAACCGAAGTCACXCCAGGCTGG 
GAXCAAAXCXCCATXAAGAAAAAAAATTAXAXATAAAXAXAXAXAXAXAXAXTATATAGCCAACTCTGTTGACAA 
AAAAAGGGAGAGATXXCCAXCCXGGXTCAGAXAAAGTXGXTGCTGXGXXXTAACAGGGGCXGGGCXGCCXXXTXC 
XACCTXGCTGGTAACXAGACCAAGAAGTTAGAGAAXAGACTAACATCAGTAACTTCCCAAAAGAAACTGAAGAGC 
CCCCXGXAAAXCTTTATGXGGCCTTCXXGGAGTXAAAAAAXGAAAGGGCATAXGTAAGXTGCAAAGGXGGAGGGT 
TXTAGACXCXCATGCXXCAGGTGCTGTCGGGGTAAAAGXAACXGTXTTXCCCCTTCTCXXAAAACCACAGAGGAC 
CXGTGACAGCTCXGCAGAAATGCCAGTGCCTGGCCCCCTCTTGCCTXTTATGGCXGAGGAAAGXXACCCAACAAA 
GGATXXXATXCCACAXXXGXGXGCCGGGTCAXXGXGAAATAAXGXXXATGCAGCCAACAXCTGACCGCCTAGXAG 
XGXCCATTGGXCTXXGGAGTGCTXCTTGXGXGXCXCAGAAAACATXTXGXGXCTGATXGTGGAAAXTTCXGACAA 
XCAATCAXAXXXGGXGGCAAGXXGCCAAAAAACAXATTTAXXCTCCXCTTXCTXCCCXXAGAACATGGXACXTGG 
AAACTGCCXXXTCCATTCACXXACAGGCXGCTTTTTTCXXCTACCXXTXXXCXXXXCXXXXXXXTTTTXCAXATG 
XGGTGAAGXCCXAAAACCXGTCTGACTTTCTGTTTCTTTTACAAAGCTCAGGTGCCXACAGAAAXCAGAAGCTGC 
XXAGAAATACXTGXGGAACCCXXXCCCTGGXGTCACXAGGGGGCCAGTAGGGAATTCXAAGAXGCCAATAXXGXG 
AGAAAXCXXXGAAGCAAGCATCAAAAGAXACXGTTTXTTCCCXAXGGGCXXCXTXTXACTTCCAAAGCACAXXGA 
GCACACXCATCCCATAXXTGTAGAAXGTGGAAAXXGATXCXGGAAGGGAAXXCCAAXAACAGXXCXTTXXAGAAA 
XGTXXXTTTCCTTGTGGTGACAXATATTCCXCTTCCTGGCTATTGGCAGGXGATGGAGTTGAGAATCAXGTACXX 
GACTTCTTGACGCAXGGCTGACCXCAGAAACAGCTCCAXCCTTXGCACTXGXCTTCXTCAXGXGTCACCCAAATA 
GGGCXXGGGXXXTXACXXXCACTXCATXXCXGAGAXXAAGGTGXAACCAAGTAGAGCAXTXCXXXGCXXGAXACA 
GAAAGXTACXAGTCXCAACCATGGCCXGGGCAXAGGAGAXGTCAAAAXAAGTTTAXTXCAAAXGGCACCATXTAA 
AXAGGGATXTXXGGXGAXXXCXCAXCAGXGGAAAGAGGAXXTTXGXXXTGCCXTXGXTXXGXXTTGAAAXXTAAA 
XCCXAAXXXCAAAACAXCACCXXGCCACCCXGACACTCCXCXXXXXAXXAXXAGCGXXXCXCAGGCACAAAGCCX 
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FIGURE STB 

GCTGCAGCTGGCCCTGGGTCCTGGCTITCAGCCAGCATCTGGCAGCCTAAGTGTACTGATAAGTGTGTTTTCTCC 
TGTTACATCATGCTGAATCCTTTCCCTTAGCCATTAGCTTTTATGATGTGGTCTTCGTAGGAAAGCCACCCTGGT 
GCCAAGCCTAGCTTGTGGGGAGGGGTATGTGTTCCAGAAACTGCTCTTTGTGTTCCCTTCAATGAGGAAACAACA 
TGTGTCTACTTATGTGGCATCCAACTGCTTGGAGCTCCACACTTCCCTTTCGCGACTCAGGCTCTGGTGCTGTTG 
CCAATCCTTGCTTGGCAAAGACTGTTCGATCATGTGGGGTCCTTATTTACAAGGGAAAGCTGGGCCAGAAGGCTA 
GCAATTCAGGTGTTACCGCTATTGCTGTACCTTGTGTTAGGACATTGTGTTTGTGCATGGACTGTGCCTCCAAAC 
TCAGTAGTTCCTATCTAAATATAAAGTATATTAGAAACCTGAAAGTACAGAATCTCAACCTTACAGTCTTTCCCT 
TAGTCCTGTGGCCTTCCTAAGCCAGCTGTTAACGTGTTGATTCCTTCCACTTCCCCCAAGTAGGCAGGCAACAGA 
TATGTTGATTGTCTTAGAAAGTAATCTGGTTCCTCTGAACTCCATTGAATTCCAGTTTGACGCATACTGCCTGGA 
ACCAGACTGTTTGCTTACAGCTTTTTAAGAAAl^TCTGCCTTGTCTGCCCCCATTATGGTTGGTCTTGGTAGCTC 
CTGGGCACTGTGGGCGTGTACCATGGGAAAGTGATTCAACACAGTGAAGGTGATTGTCTCCTCAGGCCTCCTGAA 
GCCACCTGTGCGGTGGGACTTCACGTGTCTCGGCCAAGGCGAGCATTTCCACAATGCCGTGGATGCTGCAGTCAG 
GCCAGATTGAACCATGACCCCATTTTTCACATGATACCAATTTTGTCTTAAAATTCACTGAGAAAAATGAGACTA 
AATTTTTTTTAACCCTCAGGAGCACCTGAAGCAAATATTTATCCGTATTTATXGAAAATTCATTTTGTTTCTACT 
TGAAGCTTTAACCCCTTCCATTCCTGCAAGTGTGCCTTTTGAGAGCTCCCATGGCCTAGTGACTTCACTGGTCAC 
CTTGTCCATCTTACTTAAGAATTTTAGTCTCTCCCTACCCTCTTGGACAGAGCTTCCTGTTCTCTTATTXCACAG 
GTTATACAGCAGAGCGGGTTTTGTGTTTTCATTCCTATTTCCACCCTTCATTTGGTTGGTGGACTCCACAACCTC 
ACCCTACCCTACACTTTGGGAGCACAAATTTGGTGTTGAAACAAGCTTAAATTTCATTTTAGGGCATACTGGGCT 
TACTCTCCTCCCAGCTGTCTGTGGATTGATTTGATTTTAATGTTCGAGTTTTACAGCAACAGCTGAAAACCATGA 
ACTATTCTAGGAACTGXGTTGGAACXCTTTAAAATAAAGAAAAGAGGAGGAGGAGGAGGAAGAAAGAAAACCAAC 
XXAAGAAGCCXXGACTXXGGAGGACAGAAAGCCACCAGCCAAXGGAGAACAAAGAGATGXXXCCCXTTCCXXXCX 
XXCACCXXGTCAXXCXGGGXXTCCXXCXGCXXCACXCXXTCCTXCCCCCTXAAAAGXGGXAXTCCTGGTTGGTCX 
GXCXGXCXGXCCXXGXCCXXGXGGXGATCCXGGCAXGGXGAXAXGCXCCACXXTGCAXXAXCCAXGGXCTCXXAC 
CAGCGCACAAGXCAGXGGGGAGGAXCTAACCACGCCXGGXGGTGAGGAAGCTGAAXXXCCAGGCCXGCGXCCCAT 
GXAGCCXCXCCAXGAACXGCAGAAGGCAXGXXCXGCAXGGXXACCAGXAAGTGGCTCCCXCTCACCGXGXXCATT 
GXCAAATGAGAGCAAACXXTAGGXGXTGGCXCCAXXGXACACTCXACTXGCXCTGCXCCCCXCCCXCCAACCAGG 
GXXCAXGXCAGTGCACACCCCATGXGCCCXGGCGAAGCXGGXGCXGXGAGXGATGXXXCCCAXACAACXCAGGGA 
XGCCAGGXGGCTTACCCXGAGATAGTCAXXXTGGGCACATAACAGTGXAGGAATGAAACAXGGATTXCAXXGATA 
XXXAAAXCXGXCAAXXXCAXXXXXXGXXAAXGXXXXCCCCXGAXGACXXXXXAGCAAXXXAACAAAXAAAAXGGA 
CAAXXGXCXXAAC 
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FIGURE 58 

EHKTAALVCKQWYRLIKGVAHQCYHGFMKAVQEGNIQWESRTYPYPGTPITQRFSHSACYYDANQSMYVFGGCTQ 
SSCNAAFNDLWRLDLNSKEWIRPLASGSYPSPKAGATLWYKDLLVLFGGWTRPSPYPLHQPERFFDEIHTYSPS 
KNWWNCIVTTHGPPPMAGHSSCVIDDKMIVFGGSLGSRQMSNDVWVLDLEQWAWSKPNISGPSPHPRGGQSQIVI 
DDATILILGGCGGPNALFKDAWLLHMHSGPWAWQPLKVENEEHGAPELWCHPACRVGQCWVFSQAPSGRAPLSP 
SLNSRPSPISATPPALVPETREYRSQSPVRSMDEAPCVNGRWGTLRPRAQRQTPSGSREGSLSPARGDGSPILNG 
GSLSPGTAAVGGSSLDSPVQAISPSTPSAPEGYDLKIGLSLAPRRGSLPDQKDLRLGSIDLNWDLKPASSSNPMD 
GMDNRTVGGSMRHPPEQTNGVHTPPHVASALAGAVSPGALRRSLEAIKAMSSKGPSASAALSPPLGSSPGSPGSQ 
SLSSGETVPIPRPGPAQGDGHSLPPIARRLGHHPPQSLNVGKPLYQSMNCKPMQMYVLDIKDTKEKGRVKWKVFN 
SSSWGPPETSLHTWQGRGELIIFGGLMDKKQNVKYYPKTNALYFVRAKR 
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FIGURE 59 

GCTCTGGCGGCTCCCGCGGCTCCGGCIGGCGGCTTCGGGCCCTGCACCTGTGACTCTCGGCCGCGCTCGCCCTCG 
GCCCGCCCGGCGCCGCAGCCCCATGGCCCCGTCCAGGCTGCAGCTCGGCCTCCGCGCCGCCTACTCCGGCATCAG 
CTCCGTGGCCGGCTTCTCCATCTTCCTCGTCTGGACGGTGGTCTACCGACAGCCGGGGACCGCGGCCATGGGAGG 
GCTCGCAGGGGTGCTGGCACTGTGGGTCCTGGTGACGCACGTGATGTACATGCAAGATTATTGGAGGACCTGGCT 
CAAGGGGCTGCGCGGCTTCTTCTTCGTGGGGGTCCTCTTCTCGGCCGTCTCCATCGCTGCCTTCTGCACCTTCCT 
CGTGCTGGCCATCACCCGGCATCAGAGCCTCACAGACCCCACCAGCTACTACCTCTCCAGCGTCTGGAGCTTCAT 
XTCCTXCAAGTGGGCCTTCCTGCTCAGCCTCTATO^CCACCGCTACCGGGCTGACTTTGCTGACATCAGCATCCT 
CAGCGATTTCTGACCCAGGGGGTGAGGTCTCTGCACCCTGGGGGGGCCTTAGGACCTGGACTCAGCCTCTGAGAT 
GTTGGGAGAGGCTACTCCCACCCCCTGGTGACCCCAGAACTGTGGCAGAAAATACACAGCAGGACGAGTGTGGTC 
TCCCAGGAAGCTGTCCTGCCCGTCCCCTTTCGAGGAAACCTGAGTGTGGTAGAGAGGGGATCCTGCCATGTTGCT 
CCTCATCAGCCTGGCCAGAGGGCAGCTTTAGACCTTTTCAAATGAATCTGTTTTCTTTTCTTTCTTTTTTTTTTC 
TTTTTTTTTTTTTTTTGAGATGGAGTCTTACTCTGTCACCCAGGCTGGAGTGCAGTAGTGCGATCTCAGCTCACT 
GCAACCTCCGCCTCCCAGGTTCAAGCAATTCTCCTGCCTTGGCCTCTCAAGTAGCTGGGATTACAGGCATCTGCC 
ACCATGCCCGGCAAATTTTTGTGTTTTTAGTAGAGACAGGGTTTTGCCATGTTGGCCAGGCTGGTCTCGAACTCC 
TGATCTCAGGTGATTCACCCGCCTCAGCCXXCCAAAGTGCTGGGATTATAGGTGTGAGCCACCGTGCCCGGCCTG 
GATCTGTTTXCTTAGCACGCAGXGAGGAAXCXXXGXACXXAAGGCCAGGGCAACAAAGXCAAGAGGXCAAGGTGT 
AGGGCCATGAGGCCTGGACCXAXGCTGCAGGCAAGGGTTXCCATCCCCGCXGCCCXAGGCACXCTCTXCCCAAGG 
CCAGGXXGGGCACCTGGGGAGGXCAGXTCAGAAATATCTAGCAGAGACCTCTTAAACCCCCAXCCCAGCACCCCA 
TCCTGTTGTXCCCAGAGCTGGTCXCCCAXGAGTGXGCXAGAGCCAGAXAGCCGXGGCCCCCCACCCATCXCACTC 
ACACACACAGGCAXCCATACACCCCAGAAGACXTCCCAAAXGAGGCCAGACXCAGGGXCACGGGGAAXGTGCTTC 
XGCCCCXGTAAGGGCXXXGGGGAAGGGGGCAACATAGTAGAGGCXGGAAAGAGCCCCCAAACCTGTGCCCAXGCC 
CCTCCAGCCCXGCGTTTCCATTCXGCCTTCTCAGAGXGCCCTTGCTGCACCCAGACCACCGGCCAGGAGAGACCT 
XCXCTCCCACXCCAGCCCCXCXCACTGCCCTTCAACXAGAGCXXXCACCXXXXXACAXTTCCCXXCTGAAGGACA 
CAAATCTGCTTXXCXGCCCAXACACXGGCCCAAGGGCXCACCXAACTTGGGAGGGAAGGGGCTGXXGGXACAAGG 
ATGATXTTCTGXXAGGCXGCCATXXXGCACGGTCTCCCCCXCCCCAXCXGAXGTGXCCTGCCCCXCAGCTCXTTG 
CCXTAXCXGTGTCACTGTCACXXTAGCAAAAATACAGCGGCCAXXXGXATCAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 

MPTATGLTLLTSASSAISDPGGEVSAPWGGLRTWTQPLRCWERLLPPPGDPRTVAENTQQDECGLPGSCPARPLS 
RKPECGREGILPCCSSSAWPEGSFRPFQMNLFSFLSFFFLFFFFLRWSLTLSPRLECSSAISAHCNLRLPGSSNS 
PALASQVAGITGICHHARQIFVFLVETGFCHVGQAGLELLISGDSPASAFQSAGIIGVSHRARPGSVFLARSEES 
L YLRP GQQ S QE VKV 
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FIGURE 61 

ATGGGTGTGCTGGATAGCCGGTTGCACCTGTTTCTTGGGTCTTCTCTCCTGTTCCCACTGTTCCACTTCAGGAGA 
GCCAAGCATCTCACCCATGCTCTCATATTCAGTCTAGGCATCCTACCCAAGTATCCTAATTATAAACTAGCCACC 
AGCTCAGGCTCTGTATCCCAGTCACCACCCCCATTCAGGCCCCCAACTATCCAGTGCCAGAGGCCCTCAGGTGCA 
GAGGGCCAGTTCAGCCAAACCAGGACAGACCCAGCCACCCTTGTTTCCAGCCCAGCCTACAAGCCTCCTGAAACC 
CACCAGTCTGATGGTAGCCTCCCCTCCCTGTTCTCCCAGGAAGAAAGAGAAATCGCCTCCCTCCAAAACCAAGCC 
CAACCAGAAACCTGTCCCCACCCCCTCATCAGCTGCTCCCAGATCCGAATGGGATTTGCAGAAGTCCTGAGCACT 
GGTGGTXGGTGGGCAGCAACAGGCCCTTTGCGCGGCAGCTGGACGGATGGGGTGGGGGCGAGGTCTGGGCCAGGG 
CCTGAGGGTACGGGCACTAAGGCCGAGGCGTCGCCTCCCGGCGCCTCTCCCACGGCCACGGCCGAGCGGGCTCCA 
CGGCTCCAGGAGTTCGCCGCGCTAGCTGCCTCCCCCTGCGTCCGCGTCTTGAGCTCTGAGCTCTTCACCCTGACC 
TATGGTGCCCTGGTCACCCAGCTATGTAAGGACTATGAAAATGATGAAGATGTGAATAAACAGCTGGACAAAATG 
GGCTTTAACATTGGAGTCCGGCTGATTGAAGATTTCTTGGCTCGGTCAAATGTTGGGAGGTGCCATGACTTTCGG 
GAAACTGCGGATGTCATTGCCAAGGTGGCGTTCAAGATGTACTTGGGCATCACTCCAAGCATTACTAATTGGAGC 
CCAGCTGGTGATGAATTCTCCCTCATTTTGGAAAATAACCCCTTGGTGGACTTTGTGGAACTTCCTGATAACCAC 
TCATCCCTTATTTATTCCAATCTCTTGTGTGGGGTGTTGCGGGGAGCTTTGGAGATGGTCCAGATGGCTGTGGAG 
GCCAAGTTTGTCCAGGACACCCTGAAAGGAGACGCCGCGAGGGGCGGCCGGCAGAGCGACGCGGAACCCCGCCGG 
CGCGGACCCCGGACCCCAACGCCGCCCGCCCAGCCGCGGACGCCCCTGCCCGGAGCCCTCGCCGCCCGGGCCGCC 
CTCGAGGGCCGGGAGCGGCGCCCGGCGGCCGCGCCCGCAGGCCCTGCCTCTGCTGGAACCTTCCCAGGCCCCTCC 
GACCTGAAAGCCCAGCCCCTCCTGCTGCCGCTGGTGCCGCCACCACGCAGGGTAGCCGCAGAGGCCCAGGAATCT 
TGGCAGGCGTGGGGAGGCAGCGGGTGGCGGGTGGCGCTCCGGAAAAGGCTGCAAATGCGAACCAGAAGCACGTCC 
ACGGACGCCATGCTGGGGACTCTGACACCCCTGTCTTCGCTGCTGCTGCTGCTACTGGTGCTGGTGCTGGGGTGT 
GGGCCGCGGGCGTCCTCTGGTGGCGGGGCCGGTGGGGCGGCGGGCTATGCCCCAGTGAAGTACATCCAGCCCATG 
CAGAAAGGACCTGTGGGACCGCCCTTCCGTGAGGGCAAAGGCCAGTACCTGGAAATGCCTCTACCGCTGCTGCCG 
ATGGACCTGAAGGGAGAGCCCGGCCCCCCTGGGAAGCCCGGGCCTCGGGGTCCCCCXGGCCCCCCTGGCTTCCCA 
GGAAAACCAGGCATGGGAAAGCCAGGACTCCATGGGCAGCCTGGCCCTGCTGGGCCCCCTGGCTTCTCCCGGATG 
GGCAAGGCTGGTCCCCCAGGGCTCCCTGGCAAGGTCGGGCCACCAGGGCAGCCGGGGCTTCGGGGGGAGCCAGGA 
ATACGAGGGGACCAGGGCCTCCGGGGACCCCCAGGACCCCCTGGCCTCCCGGGCCCCTCAGGCATTACTATCCCT 
GGAAAACCAGGTGCCCAAGGGGTGCCAGGGCCCCCAGGATTCCAGGGGGAACCAGGGCCCCAGGGGGAGCCTGGG 
CCCCCAGGTGATCGAGGCCTCAAGGGGGATAATGGAGTGGGCCAGCCCGGGCTGCCTGGGGCCCCAGGGCAGGGG 
GGTGCCCCCGGCCCCCCCGGCCTCCCTGGTCCAGCTGGCTTAGGCAAACCTGGTTTGGATGGGCTTCCTGGGGCC 
CCAGGAGACAAGGGTGAGTCTGGGCCTCCTGGAGTTCCAGGCCCCAGGGGGGAGCCAGGAGCTGTGGGCCCAAAA 
GGACCTCCTGGAGTAGACGGTGTGGGAGTCCCAGGGGCAGCAGGGTTGCCAGGACCACAGGGCCCATCAGGGGCC 
AAAGGGGAGCCAGGGACCCGGGGCCCCCCTGGGCTGATAGGCCCCACTGGCTATGGGATGCCAGGACTGCCAGGC 
CCCAAGGGGGACAGGGGCCCAGCTGGGGTCCCAGGACTCTTGGGGGACAGGGGTGAGCCAGGGGAGGATGGGGAG 
CCAGGGGAGCAGGGCCCACAGGGTCTTGGGGGTCCCCCTGGACTTCCTGGGTCTGCAGGGCTTCCTGGCAGACGT 
GGGCCCCCTGGGCCTAAGGGTGAGGCAGGGCCTGGAGGACCCCCAGGAGTGCCTGGCATTCGAGGTGACCAGGGG 
CCTAGTGGCCTGGCTGGGAAACCAGGGGTCCCAGGTGAGAGGGGACTTCCTGGGGCCCATGGACCCCCTGGACCA 
ACTGGGCCCAAGGGTGAGCCGGGTTTCACGGGTCGCCCTGGAGGACCAGGGGTGGCAGGAGCCCTGGGGCAGAAA 
GGTGACTTGGGGCTCCCTGGGCAGCCTGGCCTGAGGGGTCCCTCAGGAATCCCAGGACTCCAGGGTCCAGCTGGC 
CCTATTGGGCCCCAAGGCCTGCCGGGCCTGAAGGGGGAACCAGGCCTGCCAGGGCCCCCTGGAGAGGGGAGAGCA 
GGGGAACCTGGCACGGCTGGGCCCACGGGGCCCCCAGGGGTCCCTGGCTCCCCTGGAATCACGGGCCCTCCGGGG 
CCTCCCGGGCCCCCGGGACCCCCTGGTGCCCCTGGGGCCTTCGATGAGACTGGCATCGCAGGCTTGCACCTGCCC 
AACGGCGGTGTGGAGGGTGCCGTGCTGGGCAAGGGGGGCAAGCCACAGTTTGGGCTGGGCGAGCTGTCTGCCCAT 
GCCACACCGGCCTTCACTGCGGTGCTCACCTCGCCCTTCCCCGCCTCGGGCATGCCCGTGAAATTTGACCGGACT 
CTCTACAATGGCCACAGCGGCTACAACCCAGCCACTGGCATCTTCACCTGCCCTGTGGGCGGCGTCTACTACTTT 
GCTTACCATGTGCACGTCAAGGGCACCAACGTGTGGGTGGCCCTGTACAAGAACAACGTGCCGGCCACCTATACC 
TACGATGAGTACAAGAAGGGCTACCTGGACCAGGCATCTGGTGGGGCCGTGCTCCAGCTGCGGCCCAACGACCAG 
GTCTGGGTGCAGATGCCGTCGGACCAGGCCAACGGCCTCTACTCCACGGAGTACATCCACTCCTCCTTTTCAGGA 
TTCTTGCTCTGCCCCACATAA 
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FIGURE 62 

MGVLDSRLHLFLGSSLLFPLFHFRRAKHLTHALIFSLGILPKYPNYKLATSSGSVSQSPPPFRPPTIQCQRPSGA 
EGQFSQTRTDPATLVSSPAYKPPETHQSDGSLPSLFSQEEREIASLQNQAQPETCPHPLISCSQIRMGFAEVLST 
GGWWAATGPLRGSWTDGVGARSGPGPEGTGTKAEASPPGASPTATAERAPRLQEFAALAASPCVRVLSSELFTLT 
YGALVTQLCKDYENDEDVNKQLDKMGFNIGVRLIEDFLARSNVGRCHDFREJTADVIAKVAFKMYLGIXPSITNWS 
PAGDEFSLILENNPLVDFVELPDNHSSLIYSNLLCGVLRGALEMVQMAVEAKFVQDTLKGDAARGGRQSDAEPRR 
RGPRTPTPPAQPRTPLPGALAARAALEGRERRPAAAPAGPASAGTFPGPSDLKAQPLLLPLLPPPRRVAAEAQES 
WQAWGGSGWRVALflKRLQMRTRSTSTDAMLGTLTPliSSLLLLLLVLVLGCGPRASSGGGAGGAAGYAPVKYIQPM 
QKGPVGPPFREGKGQYLEMPLPLLPMDLKGEPGPPGKPGPRGPPGPPGFPGKPGMGKPGLHGQPGPAGPPGFSRM 
GKAGPPGLPGKVGPPGQPGLRGEPGIRGDQGLRGPPGPPGLPGPSGITIPGKPGAQGVPGPPGFQGEPGPQGEPG 
PPGDRGLKGDNGVGQPGLPGAPGQGGAPGPPGLPGPAGLGKPGLDGLPGAPGDKGESGPPGVPGPRGEPGAVGPK 
GPPGVDGVGVPGAAGLPGPQGPSGAKGEPGTRGPPGLIGPTGYGMPGLPGPKGDRGPAGVPGLLGDRGEPGEDGE 
PGEQGPQGLGGPPGLPGSAGLPGRRGPPGPKGEAGPGGPPGVPGIRGDQGPSGLAGKPGVPGERGLPGAHGPPGP 
TGPKGEPGFTGRPGGPGVAGALGQKGDLGLPGQPGLRGPSGIPGLQGPAGPIGPQGLPGLKGEPGLPGPPGEGRA 
GEPGTAGPTGPPGVPGSPGITGPPGPPGPPGPPGAPGAFDETGIAGLHLPNGGVEGAVLGKGGKPQFGLGELSAH 
ATPAFTAVLTSPFPASGMPVKFDRTLYNGHSGYNPATGIFTCPVGGVYYFAYHVHVKGTNVWVALYKNNVPATYT 
YDEYKKGYLDQASGGAVLQLRPNDQVWQMPSDQANGLYSTEYIHSSFSGFLLCPT 
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FIGURE 63 

GTTGCCGCTGCGCACCXGGCTCAGGTGAGCTGCCCCGCCCCCGCCCGGCGCGAGCCCCAGGTCCTGGCAGCAGCC 
CCTGACCTGTCCAGGTGCCCTGTCCAGCTGACTGCAAGGACAGAGAGGAGTCCTGCCCAGCTCTTGGATCAGTCT 
GCTGGCCGAGGAGCCCGGTGGAGCCAGGGGTGACCCTGGAGCCCAGCCTGCCCCGAGGAGGCCCCGGCTCAGAGC 
CATGCCAGGTGTCTGTGATAGGGCCCCTGACTTCCTCTCCCCGTCTGAAGACCAGGTGCTGAGGCCTGCCTTGGG 
CAGCTCAGTGGCTCTGAACXGCACGGCTTGGGTAGTCTCTGGGCCCCACTGCTCCCTGCCTTCAGTCCAGTGGCT 
GAAAGACGGGCTTCCATTGGGAATTGGGGGCCACTACAGCCTCCACGAGTACTCCTGGGTCAAGGCCAACCTGTC 
AGAGGTGCTTGTGTCCAGTGTCCTGGGGGTCAACGTGACCAGCACTGAAGTCTATGGGGCCTTCACCTGCTCCAT 
CCAGAACATCAGCTTCTCCTCCTTCACTCTTCAGAGAGCTGGCCCTACAAGCCACGTGGCTGCGGTGCTGGCCTC 
CCTCCTGGTCCTGCTGGCCCTGCTGCTGGCCGCCCTGCTCTATGTCAAGTGCCGTCTCAACGTGCTGCTCTGGTA 
CCAGGACGCGTATGGGGAGGTGGAGATAAACGACGGGAAGCTCTACGACGCCTACGTCTCCTACAGCGACTGCCC 
CGAGGACCGCAAGTTCGTGAACTTCATCCTAAAGCCGCAGCTGGAGCGGCGTCGGGGCTACAAGCTCTTCCTGGA 
CGACCGCGACCTCCTGCCGCGCGCTGAGCCCTCCGCCGACCTCTTGGTGAACCTGAGCCGCTGCCGACGCCTCAT 
CGTGGTGCTTXCGGACGCCTTCCTGAGCCGGGCCTGGTGCAGCCACAGCTTCCGGGAGGGCCTGTGCCGGCTGCT 
GGAGCTCACCCGCAGACCCATCTTCATCACCTTCGAGGGCCAGAGGCGCGACCCCGCGCACCCGGCGCTCCGCCT 
GCTGCGCCAGCACCGCCACCTGGTGACCTTGCTGCTCTGGAGGCCCGGCTCCGTGACTCCTTCCTCCGATTTTTG 
GAAAGAAGTGCAGCTGGCGCTGCCGCGGAAGGXGCGGTACAGGCCGGTGGAAGGAGACCCCCAGACGCAGCTGCA 
GGACGACAAGGACCCCATGCXGAXXCXXCGAGGCCGAGXCCCXGAGGGCCGGGCCCXGGACXCAGAGGXGGACCC 
GGACCCXGAGGGCGACCXGGGTGTCCGGGGGCCTGTXXXTGGAGAGCCATCAGCTCCACCGCACACCAGTGGGGX 
CXCGCTGGGAGAGAGCCGGAGCAGCGAAGTGGACGXCTCGGAXCXCGGCXCGCGAAACTACAGXGCCCGCACAGA 
CXXCXACXGCCXGGXGTCCAAGGATGAXATGTAGCTCCCACCCCAGAGXGCAGGATCATAGGGACAGCGGGGGCC 
AGGGCAGCGGCGTCGCXCCXCXGCXCAACAGGACCACAACCCCTGCCAGCAGCCCXGGGACCCXGCCAGCAGCCC 
TGGGAAAAGGCXGTGGCCXCAGGGCGCCTCCCAGTGCCAGAAAATAAAGXCCXTTTGGAXXCTGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 64 

MPGVCDRAPDFLSPSEDQVLRPALGSSVALNCTAWWSGPHCSLPSVQWLKDGLPLGIGGHYSLHEYSWVKANLS 
EVLVSSVLGVNVTSTEVYGAFTCSIQNISFSSFTLQRAGPTSHVAAVLASLLVLLALLLAALLYVKCRLNVLLWY 
QDAYGEVEINDGKLYDAYVSYSDCPEDRKFVNFILKPQLERRRGYKLFLDDRDLLPRAEPSADLLVNLSRCRRLI 
WLSDAFLSRAWCSHSFREGLCRLLELTRRPIFITFEGQRRDPAHPALRLLRQHRHLVTLLLWRPGSVTPSSDFW 
KEVQLALPRKVRYRPVEGDPQTQLQDDKDPMLILRGRVPEGRALDSEVDPDPEGDLGVRGPVFGEPSAPPHTSGV 
SLGESRSSEVDVSDLGSRNYSARTDFYCLVSKDDM 



wo 2004/041170 



PCT/US2003/034312 



74/2825 

FIGURE 65A 

GGAGCTGGGGATCCCCGCTCTCCTGGACCCCAATGACATGGTCTCCATGAGCGTCCCTGACTGCCTCAGCATCAT 
GACCTATGTGTCCCAGTATTAC2\ACCACTTCTGCAGTCCTGGCCAAGCTGGTGTCTCGCCACCCAGAAAGGGCCT 
TGCACCCTGTTCCCCGCCGTCTGTAGCACCCACTCCAGTGGAATCAGAAGATGTGGCTCAGGGCGAGGAGCTCTC 
CTCAGGCAGCCTGTCAGAGCAGGGCACCGGCCAGACCCCCAGCAGCACGTGCGCAGCCTGCCAGCAGCATGTGCA 
CTTGGTGCAGCGCTACCTGGCTGACGGCAGGCTGTACCATCGCCACTGCTTCCGGTGTCGGCGGTGCTCCAGCAC 
CCTGCTCCCTGGGGCTTATGAGAATGGGCCTGAGGAGGGCACCTTTGTGTGTGCAGAACACTGTGCCAGGCTGGG 
CCCGGGGACACGGTCGGGGACCAGGCCTGGGCCCTTCTCACAGCCAAAGCAGCAGCACCAGCAGCAACTCGCAGA 
AGATGCCAAGGATGTTCCAGGAGGCGGCCCCAGCTCCAGTGCTCCTGCAGGGGCTGAGGCCGATGGACCCAAGGC 
CAGCCCTGAGGCCCGGCCGCAGATCCCTACCAAGCCCCGGGTTCCTGGCAAACTACAGGAGCTGGCCAGCCCCCC 
TGCGGGCCGCCCCACCCCTGCCCCCAGGAAGGCCTCTGAGAGCACCACCCCAGCACCCCCCACGCCCCGGCCCCG 
CTCCAGTCTGCAGCAGGAGAACCTGGTGGAGCAGGCTGGCAGCAGCAGCCTGGTGAACGGGAGACTGCACGAACT 
GCCTGTCCCCAAGCCGAGGGGGACACCGAAGCCGTCCGAGGGGACACCAGCCCCCAGGAAGGACCCCCCATGGAT 
CACGCTGGTGCAGGCAGAACCAAAGAAGAAGCCAGCCCCACXTCCCCCAAGCAGCAGCCCGGGGCCACCAAGCCA 
GGACAGCAGGCAGGTGGAGAATGGAGGCACCGAGGAGGTGGCCCAGCCGAGCCCAACGGCCAGCCTGGAGTCCAA 
ACCCTATAACCCCTTTGAGGAGGAGGAGGAGGACAAGGAGGAAGAGGCTCCAGCTGCACCCAGCCTGGCCACCAG 
CCCTGCCCTGGGCCACCCGGAGTCCACACCCAAGTCCCTGCACCCCTGGTACGGCATCACCCCTACCAGCAGCCC 
CAAGACAAAGAAGCGCCCTGCCCCGCGCGCACCCAGCGCGTCCCCACTGGCTCTCCACGCCTCCCGCCTCTCGCA 
CTCGGAGCCGCCCTCGGCCACACCATCGCCAGCGCTCAGCGTGGAGAGCCTGTCGTCTGAGAGCGCCAGCCAGAC 
TGCAGGTGCAGAGCTTCTGGAGCCGCCAGCTGTGCCCAAGAGCTCCTCAGAGCCTGCTGTCCATGCCCCTGGTAC 
CCCTGGAAACCCTGTCAGCCTCTCTACCAACTCCTCCCTGGCCTCCTCTGGGGAACTAGTGGAGCCTAGAGTGGA 
ACAAATGCCTCAAGCCAGCCCTGGCCTTGCCCCCAGGACCAGGGGCAGCTCAGGTCCCCAGCCAGCCAAGCCCTG 
CAGTGGCGCCACCCCAACGCCTCTCTTGTTGGTTGGAGACAGGAGCCCGGTGCCTTCCCCTGGAAGCTCGTCCCC 
ACAGCTGCAGGTAAAGTCCTCCTGCAAGGAGAATCCTXTTAACCGGAAGCCATCACCTGCAGCGTCCCCAGCCAC 
AAAGAAGGCCACCAAGGGATCCAAGCCAGTGAGGCCACCTGCCCCTGGACACGGCTTTCCACTCATCAAACGCAA 
GGTCCAGGCTGACCAGTACATCCCTGAGGAGGACATCCATGGAGAGATGGATACCATTGAGCGCCGGCTGGATGC 
CCTGGAGCACCGXGGGGTGCTGCTGGAGGAGAAGCTGCGTGGCGGCCTGAATGAGGGCCGTGAGGATGACATGCT 
GGTGGACTGGTTCAAGCTCATCCACGAGAAGCACCTACTGGTGCGGCGAGAGTCCGAGCTCATCTATGTCTTCAA 
GCAGCAGAACCTGGAGCAGCGCCAGGCTGATGTCGAGTATGAGCTCCGGTGCCTCCTCAATAAGCCAGAAAAGGA 
CTGGACGGAGGAGGACCGGGCCCGGGAGAAGGTGCTGATGCAGGAGCTTGTGACCCTCATTGAGCAGCGCAACGC 
TATCATCAACTGCCTGGATGAGGACCGGCAGAGGGAGGAAGAGGAAGACAAGATGTTGGAAGCCATGATCAAGAA 
GAAAGAGTTCCAGAGGGAGGCTGAACCTGAGGGCAAGAAGAAGGGGAAGTTCAAGACCATGAAGATGTTGAAACT 
GCTAGGAAACAAACGTGATGCCAAGAGCAAGTCCCCCAGAGACAAGAGCTAACAGCACGAGAAGCCAGTTGGGGA 
CTGCCCCCTCCXGGAGCAGCTCCTGGGCTGTGCTCTGTTTGAAGGGGGCGCCCTGCTCCCCTCAGATCAGTCAGG 
AGGAAGATGACTAAGGGGAGGGATCCTCTGGGTGATGGCCTCTTCCTCCTCAGGGACCTCTGACTGCTCTGGGCC 
AAAGAATCTCTTGTTTCTTCTCCGAGCCCCAGGCAGCGGTGATTCAGCCCXGCCCAACCTGATTCXGATGACTGC 
GGAXGCXGXGACGGACCCAAGGGGCAAAXAGGGXCCCAGGGTCCAGGGAGGGGCGCCTGCXGAGCACTTCCGCCC 
CXCACCCTGCCCAGCCCCTGCCAXGAGCXCTGGGCXGGGTCTCCGCCTCCAGGGXTCTGCTCXTCCAGGCAGGCC 
AGCAAGTGGCGCTGGGCCACACXGGCTXCTTCCXGCCCCAXCCCXGGCTCTGAGTCTCXGXCXTCCTGTCCTGXG 
CAGGCGCCCXXGGATCXCAGXXXCCCXCACTCAGGAACTCTGXTXCTGAAGTCTTCAGTTAAGTXXGAGTTTATG 
ACXGAGTGGCCXGXACTGTCAGACGXGAAXGGGCCXGACGGGCAAATCCATCCCXCTCXCCCTCACAGXTCCAGG 
AGCGGCTXCCCXCGTCXCCCCTTACXCCACAGGGAGCCTCCCTTGCCAGGACCAGGGCXGCGACGGCCAXGCTGG 
GGCAGGXGAGXGCXCXGXXAGCXGCTCCCAGXGCTGXCCCCAGGCXGCAGTXCTGGTCCCXGGXXGTCAGGTAGG 
AAGGGXGCACXXGAAGCAGGXGCTCATCTCGGTXCCXXAACGXXXATAGTCXGACCCCXCACTXAGGCXTTCCXC 
XGCCACCCCGGXCCAGGGAAGAGGCTCGCXCCCGCCCATGGTCATCACXGGTCTGXCTGCTCTGTTGXCTGTTCX 
TXCCCXGACTCCCXCCCACCGAAGGCCTGAXGGCXACXCACCCCXCTGGGAXGGCXATGGGAGAGGAGGAGXGAX 
GGGGACCGCCACCTXXTCTGCAGGAAAXGXGCCCAGCAGCXCTXGGTCAAAGCACTGXTGCXAXAAGCXATCTCT 
GGGATGCCXCTAGGCCCCCXXCCCXCTACACACCTCXGGGAAAAGAXTACACXGTAXXAACXCXCGAGGAGTXTC 
CXCACCAAXAAACAGACAACCXCAACXGCCAGXGCCCTGCAGCCXCGGGCCACAGCGGCAGCCXXGXXXGCCXTC 
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FIGURE 65B 

CCACCTGCCTCTGCCACACCTGGTGGCTGAACATCTCTGGTCGCCCAGAGGCCATGTTGGGGCCATCCTCCAAGA 
GGGATCTCTGCCCTCACCGCCTGCCACTGGGCAGGATCCCTTTCCTCTGCAGGGAGAGGTGGCTCCTCGGCCATG 
CAGCCCCTGGCAGGCTCCTTCTAAACATGCCTGTTGACCTGGAGCTGGCGCCACCAACTCCAGGGCCTTTCCAGG 
GCCAGACAGGTAACACGCATGAACCCGAGTGACAGCTCTGACGGGCTGXTTCGGTGTCAGGAGACAAAGCTGGCA 
GGGGCAGGGGTGAACTGGAGGCAAGTCAAGTCACCTGTGGCCTGTGGGGCTGAATGTGGGCCCGGTGTTGCCAGA 
TCCTTTGTCATAAGAAGCTAGAAATCCAGATTTTATGTGTGTGTAATTTGTAAATGCTGAAAGCTAGCCTGAATT 
TTTTTTTTTTTTTTGAGACAGAGTCTCGCXCTGTCGCCCAGGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGC 
AGGCTCCGCCTCCTGGGTTCACGCCATCCTCCTGCCTCGGCCTCCTGAGCAGCTGGGACTACAGGCGCATGCTAC 
GACGCCTGGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGTTAACCAGGATGGTCTCGATCTCCT 
GACCTTGTGATCCACCCACCTTGGCCTCCCAAAGTGCXGGGATTACAGGCGTGAGCCACCACGCCCGGCCACTAG 
CCTGAATTTCAATCAAGGGTTGGCTGATACTGTGTGTCCAGGGTGGACTGGATTTGTCCTGGGGGGTTCTCTGGT 
TTGCTGCCTCCTGACCACATGATGGGGCCTTCGAGGTCGAGGACAACTGTTCCCATTAGATTGCACCCTCTGCCC 
TCAGGTTCTTGAGGGTGTGTGGACACAGAGGCTTTCCATGGGATGTCCCTGAGCCGGCCCTTGATTGGGGCCTCA 
CCATTXACAGGGCCGTTTTATTCTGCAAACCGAAACTTGGGTCATGTGACCTGATGGGATTATGGGACTCCCTCC 
AGGXGCCCGAGACAAGGXXGAXAXTXCCAAAAXAXXXXGGXGAXXTAGXGGGACAAGCAAATGACAGAATACCGG 
AGAAGGCAGGGATCGTGGGTGXCAGGAGCCAGAGGGGAGGGGGACAGAXGXGCTGXGXACAGGACAAGGXGXCAG 
GTGACXCCTXCCCAGCAGGGCCTCGCAGATGCACAAGCACGGAGCXGGTGGGXXXXGCCCAAGAAAGGXCACGCG 
GCACAXGCAGGGAXXGGAACXCCCAGGCCAGGGCXCTAGGXCGCTCCCACCXXXXCAXGXXXCXXXCXGXGGCCA 
TGGGTATAGXGGAAAGACAXAAAGCTAAAGCCAACXTTTAATCCTGAAXGCACTGCXXGCCAGGXAAAXGCCCTX 
GGXTGTGGXAXCTXGTXGAGACTTAGTTXXCACAGAGGGAXAAXGAACCGXXGCAGAGGXXTATXGAGATCAXTA 
ACAGAGXGGAAXXCAGCACCXGCCACTGCACXCCAGCCXGGGCGACAGAGCAAGACXCAGXCXCAAAAAAAAAAA 
AAATTAGCXGGGCAGGGCAXGGTGAXGGXGCCTGXAAXCCXAGCXACTTGGGAGGCXGAGGCATGAGAAXTGCCX 
GAACCCAGGAGGXGGAGGXTGCAGXGAGCCGAGAXCGXGCCACXGXACXCCAGCCXGGGTGACAGCGCGAGACXC 
CGXCXCAAAAAAAGCXGGGXGXGGGGAACACCXGXGGXCCCAGCTAXXCXGGAGACXGAGGCAGGAGGATXGCXX 
GAGCXCAGGAGXXCTGGCTGCAGTGAGCTATGAXCATGCCACXGXAXXACAGAATGGGTGACAGAATGAGAGCGA 
CACXGXCXCAAAAAAAAAAAAAAAAAAAGGCCGGGAGCGGXCGXXXGTGCCXGTAATCCCAACACXXTGGGAGGC 
CAGGGTGGGCGGATCACXXGAGGCCAGGAGXTCAAGACCAGCCXGGCCAACAXGGXGAAXCCCCAXCXCXACXAA 
AAAAAXTAACXGGACATGGTGGXGGACACXTGTAATCCCAGCTACTCAGGAGGCXGACACATGAGAAXTGCTXGA 
ACCCGGGAGGCGGAGGXXACAGXGAGCCGAGAXAGCACCACXGCACXCCAACCTGGGCACAGAGXAAGGCXCXGT 
CXXX 
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FIGURE 66 

ELGIPALLDPNDMVSMSVPDCLSIMTYVSQYYNHFCSPGQAGVSPPRKGLAPCSPPSVAPTPVESEDVAQGEELS 
SGSLSEQGTGQTPSSTCAACQQHVHLVQRYLADGRLYHRHCFRCRRCSSTLLPGAYENGPEEGTFVCAEHCARLG 
PGTRSGTRPGPFSQPKQQHQQQLAEDAKDVPGGGPSSSAPAGAEADGPKASPEARPQIPTKPRVPGKLQELASPP 
AGRPTPAPRKASESTTPAPPTPRPRSSLQQENLVEQAGSSSLVNGRLHELPVPKPRGTPKPSEGTPAPRKDPPWI 
TLVQAEPKKKPAPLPPSSSPGPPSQDSRQVENGGTEEVAQPSPTASLESKPYNPFEEEEEDKEEEAPAAPSLATS 
PALGHPESTPKSLHPWYGITPTSSPKTKKRPAPRAPSASPLALHASRLSHSEPPSATPSPALSVESLSSESASQT 
AGAELLEPPAVPKSSSEPAVHAPGTPGNPVSLSTNSSLASSGELVEPRVEQMPQASPGLAPRTRGSSGPQPAKPC 
SGATPTPLLLVGDRSPVPSPGSSSPQLQVKSSCKENPFNRKPSPAASPATKKATKGSKPVRPPAPGHGFPLIKRK 
VQADQYIPEEDIHGEMDTIERRLDALEHRGVLLEEKLRGGLNEGREDDMLVDWFKLIHEKHLLVRRESELIYVFK 
QQNLEQRQADVEYELRCLLNKPEKDWTEEDRAREKVLMQELVTLIEQRNAIINCLDEDRQREEEEDKMLEAMIKK 
KEFQREAEPEGKKKGKFKTMKMLKLLGNKRDAKSKSPRDKS 
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FIGURE 67 

CTTTGTTTTGCTTCGAG ATGG CT6CGGGGATGTATTTGGAACATTATCTGGACAGTATTGAAAACCTTCCCTTTG 
AATTACAGAGAAACTTTCAGCTCATGAGGGACCTAGACCAAAGAACAGAGGACCTGAAGGCTGAAATTGACAAGT 
TGGCCACTGAGTATATGAGTAGTGCCCGCAGCCTGAGCTCCGAGGAAAAATTGGCCCTTCTCAAACAGATCCAGG 
AAGCCXATGGCAAGTGCAAGGAATTTGGTGACGACAAGGTGCAGCTTGCCATGCAGACCTATGAGATGGTGGACA 
AACACATTCGGCGGCTGGACACAGACCTGGCCCGTTTTGAGGCTGATCTCAAGGAGAAACAGATTGAGTCAAGTG 
ACTATGACAGCTCTTCCAGCAAAGGCAAAAAGAAAGGCCGGACTCAAAAGGAGAAGAAAGCTGCTCGTGCTCGTT 
CCAAAGGGAAAAACTCGGATGAAGAAGCCCCCAAGACTGCCCAGAAGAAGTTAAAGCTCGTGCGCACAAGTCCTG 
AGTATGGGATGCCCTCAGTGACCTTTGGCAGTGTCCACCCCTCTGATGTGTTGGATATGCCTGTGGATCCCAACG 
AACCCACCTATTGCCTTTGTCACCAGGTCTCCTATGGAGAGATGATTGGCTGTGACAACCCTGATTGTTCCATTG 
AGTGGTTCCATTTTGCCTGTGTGGGGCTGACAACCAAGCCTCGGGGGAAATGGTTTTGCCCACGCTGCTCCCAAG 
AACGGAAGAAGAA ATAGA TAAGGGCCTTGGATTCCAACACAGTTTCTTCCACATCCCCTGACTTGGGCTAGTGGG 
CAGAGGAATGCCTGTGCTGGGGCCAGGGGTTCAGGGAGGAGTGGATGGCACAGTGCTGTCATCCCTTCTCCTCCC 
CXCTCCCCACTCCCGGTGCTGAGGCTGCATCAGACCCTGGTAGGGAGGGGTGCCGCAGCCACTAACGGTATGTGC 
TCTCCTTCAGCCCTCTCCTTCGGAGGGACGTGGTCTTGCCCACTGTCCTTTTGCCTCCATGCTGAGGTCGGTGCT 
GTATTTCAGAGGGAGGGTCCTTTTCATTCTCCTTGCTTTGTATTTAAGGACTGGGGCATAGCATGGGGGCAGTCC 
CCCAGACCTCTTCATTCCCCCTCCTGTGGTGAGGGCTAGGTGTGATCAACACTTTTCTTCTCCATTCCCTXCCXG 
CTTTTTTCATGGTGGGGATCCACCAGGTCATCTAGCTCTGGCCCTAGTTGAAGGGCACCCCTTCCTCTGTGCCAA 
GAGGATTCATCCTGGGAGAGGGGGCAAGGTGGAATGCAGATAACTCACATGTAAAAGGAACTTGGGTAGGTAAAT 
AAAAGCTATACATGTTGAAAAAAAAAA 
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FIGURE 68 

MAAGMYLEHYLDSIENLPFELQRNFQLMRDLDQRTEDLKAEIDKLATEYMSSARSLSSEEKLALLKQIQEAYGKC 
KEFGDDKVQLAMQTYEMVDKHIRRLDTDLARFEADLKEKQIESSDYDSSSSKGKKKGRTQKEKKAARARSKGKNS 
DEEAPKTAQKKLKLVRTSPEYGMPSVTFGSVHPSDVLDMPVDPNEPTYCLCHQVSYGEMIGCDNPDCSIEWFHFA 
CVGLTTKPRGKWFCPRCSQERKKK 
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FIGURE 69 

TACGTGAAGCACCGACACAAACTGGAGAATGGTCTGGCTGCGCTCAGTCCCTTAAGCAAGGGCTCCMG^ 

GGCCCTTACCTGCCCCGAGCCCTGCAGCAGCCTCTGGAACAGCTGACTCGGTATGGGCGGCTCCXGGAGGAGCTC 

CTGAGGGAAGCTGGGCCTGAGCTCAGTTCTGAGTGCCGGGCCCTTGGGGCTGCTGTACAGCTGCTCCGGGAACAA 

GAGGCCCGTGGCAGAGACCTGCTGGCCGTGGAGGCGGTGCGTGGCTGTGAGATAGATCTGAAGGAGCAGGGACAG 

CTCTTGCATCGAGACCCCTTCACTGTCATCTGTGGCCGAAAGAAGTGCCTTCGCCATGTCTTTCTCTTCGAGCAX 

CTCCTCCTGTTCAGCAAGCTCAAGGGCCCTGAAGGGGGGTCAGAGATGTTTGTTTACAAGCAGGCCTTTAAGACT 

GCTGATATGGGGCTGACAGAAAACATCGGGGACAGCGGACTCTGCTTTGAGTTGTGGTTTCGGCGGCGGCGTGCA 

CGAGAGGCATACACTCTGCAGGCAACCTCACCAGAGATCAAACTCAAGTGGACAAGTTCTATTGCCCAGCTGCTG 

TGGAGACAGGCAGCCCACAACAAGGAGCTCCGAGTGCAGCAGATGGTGTCCATGGGCATTGGGAATAAACCCTTC 

CTGGACATCAAAGCCCTTGGGGAGCGGACGCTGAGTGCCCTGCTCACTGGAAGAGCCCCAGAAACACTTGACTCT 

TCTGGAGATGTGTCCCCAGGACCAAGAAACAGCCCCAGCCTGCAACCCCCCCACCCTGGGAGCAGCACTCCCACC 

CTGGCCAGTCGAGGGATCTTAGGGCTATCCCGACAGAGTCATGCTCGAGCCCTGAGTGACCCCACCACGCCTCTG 

TGA CCTGGAGAAGATCCAGAACTTGCGTGCAGCTTCTCCTCTCAGCACACTTTGGGCTGGGATGGCAGTGGGGCA 

TAATGGAGCCCTGGGCGATCGCTGAATTTCTTCCCTCTGCTTCCTGGACACAGAGGAGGTCTAACGACCAGAGTA 

TTGCCCTGCCACCACTATCTCTAGTCTCCCTAGCTTGGTGCCTTCTCCTGCAGGAGTCAGAGCAGCCACATTGCT 

TGCCTTCATACCCTGGAGGTGGGGAAGTTATCCCTCTTCCGGTGCTTXCCCATCCTGGGCCACTGTATCCAGGAC 

ATCACTCCCATGCCAGCCCTCCCTGGCAGCCCATGTTCTCCTCTTTTCTCACCCCCTGACTTTCCCTGAGAAGAA 

TCATCTCTGCCAGGTCAACTGGAGTCCCTGGTGACTCCATTCTGAGGTGTCACAAGCAATGAAGCTATGCAAACA 

ATAGGAGGGTGTGACAGGGGAACCGTAGACTTTATATATGTAATXACTGTTATTATAATACTATTGTTATATTAA 

ATGTATTTACTCACACTTTGCCTCT 
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FIGURE 70 

MEAGPYLPRALQQPLEQLTRYGRLLEELLREAGPELSSECRALGAAVQLLREQEARGRDLLAVEAVRGCEIDLKE 
QGQLLHRDPFTVICGRKKCLRHVFLFEHLLLFSKLKGPEGGSEMFVYKQAFKTADMGLTENIGDSGLCFELWFRR 
RRAREAYTLQATSPEIKLKWTSSIAQLLWRQAAHNKELRVQQMVSMGIGNKPFLDIKALGERTLSALLTGRAPET 
LDSSGDVSPGPRNSPSLQPPHPGSSTPTLASRGILGLSRQSHARALSDPTTPL 
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FIGURE 71 

CTTTCTGCCCGACTCCCACAGCTACACCATGCAGGAATTCGCCCGGCGTTACTTCCGGAGGTCCCAGGCCTTGCT 
GGGCCAGACTGATGGAGGTGCCGCAGGAAAGGACACGGACAGCCTGGTGCAGTACACCAAGGCTCCCATCCAGGA 
GTCGCTCCTCAGCCTCAGTGATGATGTGAGCAAGCTGGCTGTAGCCAGCTTCCTGGCCCTGATGCGGTTTATGGG 
TGACCAGXCCAAGCCCCGGGGCAAGGATGAGATGGATCTGCTCTATGAACTGCTGAAGCTGTGCCAGCAGGAGAA 
GCTGAGGGATGAGATTTACTGCCAGGTTATCAAGCAGGTCACGGGACACCCCCGGCCGGAACACTGCACTCGAGG 
CTGGAGCTTCCTCAGCCTTCTCACAGGCTTCTTCCCCCCGTCGACCAGGCTGATGCCCTACCTGACCAAGTTTCT 
GCAGGATTCAGGCCCCAGCCAAGAGCTGGCCCGGAGCAGCCAGGAGCACCTCCAGCGCACAGTCAAATATGGGGG 
GCGCCGGCGGATGCCCCCACCGGGTGAAATGAAGGCTTTCCTGAAAGGACAAGCGATTCGCCTGCTTCTTATTCA 
CCTGCCGGGGGGTGTGGATTATAGGACGAATATCCAGACTTTCACAGTAGCAGCAGAAGTGCAGGAGGAGCTGTG 
CCGGCAAATGGGTATCACGGAGCCTCAGGAAGTGCAGGAATTCGCCCTCTTCCTCATCAAAGAGAAGAGCCAGCT 
GGTGCGGCCCCTGCAGCCCGCCGAATGCCTCAACAGCGTGGTAGTGGACCAGGACGTGAGCCTGCACAGCCGGCG 
GCTCCACTGGGAGACCCCACTGCACTTCGATAACTCCACCTACATCAGCACCCACTACAGCCAGGTGCTGTGGGA 
CTACCTXCAGGGGAAGCTGCCAGTCAGCGCCAAGGCAGACGCGCAGCTCGCCAGGCTGGCCGCCCTGCAGCACCT 
CAGCAAGGCCAACAGGAATACCCCCXCAGGGCAGGACCTGCTAGCTTACGTGCCAAAGCAGCTGCAACGGCAGGT 
GAACACGGCCTCCATCAAGAACCTGATGGGTCAGGAGCTGAGACGGCTGGAAGGACACAGCCCCCAGGAAGCACA 
GATCAGCTTCATCGAGGCCATGAGCCAGCTGCCCCTCTTCGGCTACACCGTCTATGGGGTGCTGCGAGTGAGCAT 
GCAGGCCCTGTCCGGACCCACTCTCCTGGGGCTCAACCGCCAGCATCTCATCCTCATGGACCCCAGCTCCCAGAG 
CCTGTACTGCCGCATTGCCCTGAAGAGCCTGCAGCGGCTCCACCTGCTAAGCCCTCTGGAGGAGAAGGGGCCCCC 
TGGCCTGGAAGTCAACTATGGCTCAGCTGACAACCCCCAGACCATCTGGTTTGAGCTGCCACAGGCCCAGGAGCT 
GCTATACACCACTGTCTTCCTGATAGACAGCAGTGCCTCTTGCACTGAGTGGCCCAGCATCAACTGAGAGGAGTG 
CAGGCCGGGGAGAGAAGAGGATGAGGCCTCCCCCGGCCCAAGTCTCACCCACATGGTCTGCCTTGGATGCTATCA 
GATCACTGTTCTAGAACCTGCCTCAGCACAGCCCAGCCGGCCCACATGCAGGCCATGAGGCAGGGGCTGCTATCA 
CGTCACCAGCAGGCAAAGAAAACAGCCAGACCCTCTCCAGGACGGCCTGGGGCCAAAGCGGGCTGCAGGAACTCG 
GCTGGGGCACCTGAGGTTGCCCAGTCTGAGGGAGATGCCCACCCGACCCCAGGCTCCGCCCAGGCCCCACATXAG 
CACAAGCCCAGGCATGGGAGAAACAGCTGCTGAGGAAATAAAACTCCCTGGAGAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAA 
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FIGURE 72 

MQEFARRYFRRSQALLGQTDGGAAGKDTDSLVQYTKAPIQESLLSLSDDVSKLAVASFLALMRFMGDQSKPRGKD 
EMDLLYELLKLCQQEKLRDEIYCQVIKQVTGHPRPEHCTRGWSFLSLLTGFFPPSTRLMPYLTKFLQDSGPSQEL 

ARSSQEHLQRTVKYGGRRRMPPPGEMKAFLKGQAIRLLLIHLPGGVDYRTNIQTFTVAAEVQEELCRQMGITEPQ 
EVQEFALFLIKEKSQLVRPLQPAECLNSVWDQDVSLHSRRLHWEXPLHFDNSTYISTHYSQVLWDYLQGKLPVS 
AKADAQLARLAALQHLSKANRNTPSGQDLLAYVPKQLQRQVNTASIKNLMGQELRRLEGHSPQEAQISFIEAMSQ 
LPLFGYTVYGVLRVSMQALSGPTLLGLNRQHLILMDPSSQSLYCRIALKSLQRLHLLSPLEEKGPPGLEVNYGSA 
DNPQTIWFELPQAQELLYXTVFIilDSSASCTEWPSIN 
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FIGURE 73 

CAGGTCCCCCAGACCATCCGAACCAGCCGCCCATGATGACCTGCGAGAGACTCCCATGCCCCACTGCAGGCCTGG 

GCCCCCTCAGGGAGGAAGCGATGAAGCCAGGGGCAGCCTCCAGTCCCTTGCAGCAGGTCCCCGCCCCGCCACTGC 

CTGCGAAGAAGAACCTTCCCACTGCCCCTCCCAGACGCCGCGTTTCCGAGAGGGTGTCCTTAGAAGACC7VAAGTC 

CGGGGATGGCGGCAGAGGGGGACCAGCTCAGCCTGCCTCCCCAAGGGACCTCAGACGGCCCTGAGGACACGCCCC 

GGGAGAGCACGGAGCAAGGCCAGGACACAGAGGTGAAAGCCAGCGATCCTCACAGCATGCCAGAGCTGCCCAGGA 

CAGCCAAACAACCCCCAGTCCCGCCCCCCAGGAAAAAACGGATCTCTCGACAACTGGCCTCGACCCTCCCAGCTC 

CCTTAGAGAACGCTGAGCTCTGCACACAGGC GATGG CCTTGGAGACACCCACGCCGGGTCCACCCAGAGAGGGCC 

AAAGCCCTGCTTCTCAGGCTGGGACTCAGCACCCTCCTGCCCAGGCCACTGCCCATTCCCAGAGCTCTCCAGAGT 

TCAAGGGCTCCCTGGCCTCCCTCTCAGACAGCTTGGGGGTGTCTGTCATGGCCACCGACCAGGACTCCTACTCCA 

CCAGCAGCACGGAGGAGGAGCTGGAGCAGTTCAGCAGCCCCAGCGXGAAGAAGAAGCCCTCCATGATCCTGGGCA 

AGGCTCGGCACCGGCTGAGCTTTGCCAGTTTCAGCAGCATGTTCCACGCTTTCCTCTCCAACAACCGCAAGCTGT 

ACAAGAAGGTGGTGGAGCTGGCGCAGGACAAGGGCTCGTACTTTGGCAGCCTGGTGCAGGACTACAAGGTGTACA 

GCCTGGAGATGATGGCGCGCCAGACCTCCAGCACGGAGATGCTGCAGGAGATTCGCACCATGATGACCCAGCTCA 

AGAGCTACCTGCTGCAGAGCACCGAGCTCAAGGCCCTGGTGGACCCCGCCCTGCACTCCGAGGAGGAGCTCGAAG 

CAATTGTAGAGTCTGCCTTGTACAAATGTGTCCTGAAGCCCCTGAAGGAAGCCATCAACTCATGCCTGCATCAGA 

TCCACAGCAAGGATGGTTCGCTGCAGCAGCTCAAGGAGAACCAGTTAGTGATCCTGGCCACCACCACCACTGACC 

TAGGTGTGACCACCAGCGTGCCGGAGGTGCCCATGATGGAGAAGTTCCTGCAGAAGTTCACCAGCATGCACAAGG 

CCTACTCACCTGAGAAGAAGATCTCCATCCTGCTCAAGACCTGCAAACTCATCTACGACTCCATGGCCCTCGGCA 

ACCCAGGGAAGCCCTATGGGGCGGATGACTTCCTGCCXGTGCTCATGTATGTGCTGGCCCGCAGCAACCTCACGG 

AGATGCTTCTCAATGTGGAGTACATGATGGAGCTCATGGACCCCGCCCTGCAGCTGGGGGAGGGTTCCTACTATC 

TGACCACCACCTACGGGGCCCTGGAGCACATCAAGAGCTACGACAAGATCACGGTGACCCGGCAGCTGAGTGTGG 

AGGTGCAGGACTCCATCCACCGCTGGGAGCGCCGGCGTACTCTCAACAAGGCCCGGGCCTCCCGCTCCTCCGTAC 

AGGACTTCATCTGCGTGTCGTACCTGGAGCCCGAGCAGCAGGCGCGGACGCTGGCGTCGCGGGCGGACACCCAGG 

CCCAGGCGCTGTGCGCGCAGTGCGCGGAGAAGTTCGCGGTGGAGCGGCCGCAGGCGCACCGGCTGTTCGTGCTGG 

TGGACGGGCGCTGCTTCCAGCTGGCGGACGACGCGCTGCCGCACTGCATCAAGGGCTACCTGCTGCGCAGCGAGC 

CCAAGCGCGACTTCCACTTTGTCTACCGGCCCCTGGACGGTGGT'GGCGGCGGCGGCGGGAGCCCGCCCTGCCTGG 

TGGTGCGGGAGCCCAACTTCCTGTGAGGCCCTCCCGGGGCGCCTCCCCTCACCCCCAGGCGCACGTCTGGCCCCG 

CCTCTGGCTGCGCACTCCCGACCGCGACGTCCACGCAGCAGAGGGACATGGGCCATTCCATGACGTGCCCAGGCC 

AACGTCGCAGGACAGTTGTGAAAATAACAXGACGCTCGTCCAAGGCCACTTCCTGAGGGCAAGTCCTAATAGCCC 

XGAGACACCGAGACGGCATGTTCTTCATTAGACGGAAAGGGAAACTGAGGCTCAGGAGAGAGGCGCTCCAGGCCG 

CTGCAGCCAACAAACCGGCGCCCTCXTCACACGTAGCXCCTCAGGCCAXXCCCCATGAGTCCCCCACACCCACCC 

CAXCCTCGGXCXXXGCAAAGAAGGGCCCGAGCXXAGTTXCCCCAGGACXGGCCTAGGAAGGAGCACCGGCCACAG 

CXGCXCCCCGXGCCACTCAGGGXGGACCCAGCAXCTCAGGAGCACCXCAGGGTGXCGGXXAAGAGACAGGCCXCC 

ACCCCTGCACCAGAGCCXGCXCTTAGCTCCCGGXGCCCCCAGACCCCGCAGCXCTGTGCCCGGCAAGAAGCAGCT 

AGACACACGCCAGGGCGCCAAGCACGGCCAGCTCCGCGGACCCAXGGACAAGCCCAGCCCCGCCTCAGGGAAAAC 

AGGGCCXXTCCCXACAGGACACGCCCCAGAGCXGTGGCAGATXGCAGGAGCCATCGXGXGGGGAGAAXTTAAXAA 

AAGCCCXXTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAi^^ 

AAAAAAAAAAAAAAAAAA 
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FIGURE 74 

MALETPTPGPPREGQSPASQAGTQHPPAQATAHSQSSPEFKGSLASLSDSLGVSVMATDQDSYSTSSTEEELEQF 
SSPSVKKKPSMILGKARHRLSFASFSSMFHAFLSNNRKLYKKWELAQDKGSYFGSLVQDYKVYSLEMMARQTSS 
TEMLQEIRTMMTQLKSYLLQSTELKALVDPALHSEEELEAIVESALYKCVLKPLKEAINSCLHQIHSKDGSLQQL 
KENQLVILATTTTDLGVTTSVPEVPMMEKFLQKFTSMHKAYSPEKKISILLKTCKLIYDSMALGNPGKPYGADDF 
LPVLMYVLARSNLTEMLLNVEYMMELMDPALQLGEGSYYLTTTYGALEHIKSYDKITVTRQLSVEVQDSIHRWER 
RRTLNKARASRSSVQDFICVSYLEPEQQARTLASRADTQAQALCAQCAEKFAVERPQAHRLFVLVDGRCFQLADD 
ALPHCIKGYLLRSEPKRDFHFVYRPLDGGGGGGGSPPCLWREPNFL 
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FIGURE 75 

ACGAGGGACGCAGCCATGGCGGAGGCGGCTTTGGAAGCCGTGCGGAGCGAGTTACGAGAATTCCCGGCCGCTGCA 
AGGGAGCTCTGCGTGCCTCTTGCTGTGCCCTACCTGGACAAACCCCCAACTCCGCTCCACTTCTACCGGGACTGG 
GTCTGCCCCAACAGGCCGTGCATTATCCGCAACGCTCTGCAGCACTGGCCGGCCCTCCAGAAGTGGTCCCTCCCC 
TATTTCAGAGCCACAGTGGGCTCCACAGAGGTGAGTGTGGCCGTGACCCCAGATGGTTACGCGGATGCCGXGAGA 
GGGGATCGCTTCATGATGCCAGCTGAGCGCCGCCTGCCCCTGAGCTTCGTGCTGGATGTGCTGGAGGGCCGGGCC 
CAGCACCCTGGAGTCCTCTATGTGCAGAAGCAGTGCTCCAACCTGCCCAGCGAGCTGCCCCAGCTGCTGCCTGAT 
CTGGAATCCCATGTGCCCTGGGCCTCCGAAGCCCTGGGAAAGATGCCCGATGCTGTGAACTTCTGGCTGGGGGAG 
GCGGCTGCAGTGACTTCTTTGCACAAGGACCACTATGAGAACCTCTACTGCGTGGTCTCAGGAGAGAAGCATTTC 
CTGTTCCATCCGCCCAGCGACCGGCCCTTCATCCCCTATGAGCTGTACACGCCGGCAACCTACCAGCTAACTGAA 
GAGGGCACCTTTAAGGTGGTGGATGAAGAGGCCATGGAGAAGGCAGAGGTGTCCAGGACCTGCCTGCTCACGGTT 
CGTGTCCTGCAGGCCCATCGCCTACCCTCTAAGGACCTAGTGACCCCCTCTGACTGCTACGTGACTCTCTGGCTG 
CCCACGGCCTGCAGCCACAGGCTCCAGACACGCACGGTCAAGAACAGCAGTAGCCCTGTCTGGAACCAGAGCTTT 
CACTTCAGGATCCACAGGCAGCTCAAGAATGTCATGGAACTGAAAGTCTTTGACCAGGACCTGGTGACCGGAGAT 
GACCCTGTGTTGTCAGTACTGTTTGATGCGGGGACTCTGCGGGCTGGGGAGTTCCGGCGCGAGAGCTTCTCACTG 
AGCCCTCAGGGTGAGGGGCGCCTGGAAGTTGAATTTCGCCTGCAGAGTCTGGCTGACCGTGGCGAGTGGCTCGTC 
AGCAATGGCGTTCTGGTGGCCCGGGAGCTCTCCTGCTTGCACGTTCAACTGGAGGAGACAGGAGACCAGAAGTCC 
TCAGAGCACAGAGTTCAGCTTGTGGTTCCTGGGTCCTGTGAGGGTCCGCAGGAGGCCTCTGTGGGCACTGGCACC 
TTCCGCTTCCACTGCCCAGCCTGCTGGGAGCAGGAGCTGAGTATTCGCCTGCAGGATGCCCCCGAGGAGCAACTA 
AAGGCGCCACTGAGTGCCCTGCCCTCTGGTCAAGTGGTGAGGCTTGTCTTCCCCACGTCCCAGGAGCCCCTGATG 
AGAGTGGAGCTGAAAAAAGAAGCAGGACTGAGGGAGCTGGCCGTGCGACTGGGCTTCGGGCCCTGTGCAGAGGAG 
CAGGCCTTCCTGAGCAGGAGGAAGCAGGTGGTGGCCGCGGCCTTGAGGCAGGCCCTGCAGCTGGATGGAGACCTG 
CAGGAGGATGAGATCCCAGTGGTAGCTATTATGGCCACTGGTGGTGGGATCCGGGCAATGACTTCCCTGTATGGG 
CAGCTGGCTGGCCTGAAGGAGCTGGGCCTCTTGGATTGCGTCTCCTACATCACCGGGGCCTCGGGCTCCACCTGG 
GCCTTGGCCAACCTTTATGAGGACCCAGAGTGGTCTCAGAAGGACCTGGCAGGGCCCACTGAGTTGCTGAAGACC 
CAGGTGACCAAGAACAAGCTGGGTGTGCTGGCCCCCAGCCAGCTGCAGCGGTACCGGCAGGAGCTGGCCGAGCGT 
GCCCGCXTGGGCTACCCAAGCTGCTTCACCAACCTGTGGGCCCTCATCAACGAGGCGCTGCTGCATGATGAGCCC 
CATGATCACAAGCTCTCAGATCAACGGGAGGCCCTGAGTCATGGCCAGAACCCTCTGCCCATCTACTGTGCCCTC 
AACACCAAAGGGCAGAGCCTGACCACTTTTGAATTTGGGGAGTGGTGCGAGTTCTCTCCCTACGAGGTCGGCTTC 
CCCAAGTACGGGGCCXTCATCCCCTCTGAGCTCTTTGGCTCCGAGTTCTTTATGGGGCAGCTGATGAAGAGGCTX 
CCTGAGTCCCGCATCTGCTTCTTAGAAGGTATCTGGAGCAACCTGTATGCAGCCAACCTCCAGGACAGCTTATAC 
TGGGCCTCAGAGCCCAGCCAGTTCTGGGACCGCTGGGTCAGGAACCAGGCCAACCTGGACAAGGAGCAGGTCCCC 
CTTCTGAAGATAGAAGAACCACCCTCAACAGCCGGCAGRATAGCTGAGTTTTTCACCGATCTTCTGACGTGGCGT 
CCACTGGCCCAGGCCACACATAATTTCCTGCGTGGCCTCCATTTCCACAAAGACTACTTTCAGCATCCTCACTTC 
TCCACATGGAAAGCTACCACTCTGGATGGGCTCCCCAACCAGCTGACACCCTCGGAGCCCCACCTGTGCCTGCTG 
GATGTTGGCTACCTCATCAATACCAGCTGCCTGCCCCTCCTGCAGCCCACTCGGGACGTGGACCTCATCCTGTCA 
TTGGACTACAACCTCCACGGAGCCTTCCAGCAGTTGCAGCTCCTGGGCCGGTTCTGCCAGGAGCAGGGGATCCCG 
TTCCCACCCATCTCGCCCAGCCCCGAAGAGCAGCTCCAGCCTCGGGAGTGCCACACCTTCTCCGACCCCACCTGC 
CCCGGAGCCCCTGCGGXGCTGCACTTTCCTCTGGTCAGCGACTCCTTCCGGGAGTACTCGGCCCCTGGGGTCCGG 
CGGACACCCGAGGAGGCGGCAGCXGGGGAGGXGAACCXGXCXTCAXCGGACXCTCCCXACCACTACACGAAGGXG 
ACCXACAGCCAGGAGGACGXGGACAAGCXGCXGCACCXGACACAXXACAAXGXCXGCAACAACCAGGAGCAGCTG 
CXGGAGGCXCXGCGCCAGGCAGXGCAGCGGAGGCGGCAGCGCAGGCCCCACTGATGGCCGGGGCCCCXGCCACCC 
CXAACXCXCAXXCAXXCCCXGGCXGCTGAGXXGCAGGXGGGAACXGXCAXCACGCAGXGCXXCAGAGCCXCGGGC 
XCAGGTGGCACKGXCCCAGGGXCCAGGCXGAGGGCXGGGAGCXCCCXTGCGCCTCAGCAGXXXGCAGXGGGGXAA 
GGAGGCCAAGCCCAXXXGTGTAAXCACCCAAAACCCCCCGGCCXGXGCCXGXXTXCCCXXCXGCGCXACCXXGAG 
XAGXXGGAGCACXXGAXACAXCACAGACXCAXACAAAAAAAAAAAAAAAAAA 
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FIGURE 76 

MAEAALEAVRSELREFPAAARELCVPLAVPYLDKPPTPLHFYRDWVCPNRPCIIRNALQHWPALQKWSLPYFRAT 
VGSTEVSVAVTPDGYADAVRGDRFMMPAERRLPLSFVLDVLEGRAQHPGVLYVQKQCSNLPSELPQLLPDLESHV 
PWASEALGKMPDAVNFWLGEAAAVTSLHKDHYENLYCWSGEKHFLFHPPSDRPFIPYELYTPATYQLTEEGTFK 
WDEEAMEKAEVSRTCLLTVRVLQAHRLPSKDLVTPSDCYVTLWLPTACSHRLQTRTVKNSSSPVWNQSFHFRIH 
RQLKNVMELKVFDQDLVTGDDPVLSVLFDAGTLRAGEFRRESFSLSPQGEGRLEVEFRLQSLADRGEWLVSNGVL 
VARELSCLHVQLEETGDQKSSEHRVQLWPGSCEGPQEASVGTGTFRFHCPACWEQELSIRLQDAPEEQLKAPLS 
ALPSGQWRLVFPTSQEPLMRVELKKEAGLRELAVRLGFGPCAEEQAFLSRRKQWAAALRQALQLDGDLQEDEI 
PWAIMATGGGIRAMTSLYGQLAGLKELGLLDCVSYITGASGSTWALANLYEDPEWSQKDLAGPTELLKTQVTKN 
KLGVLAPSQLQRYRQELAERARLGYPSCFTNLWALINEALLHDEPHDHKLSDQREALSHGQNPLPIYCALNTKGQ 
SLTTFEFGEWCEFSPYEVGFPKYGAFIPSELFGSEFFMGQLMKRLPESRICFLEGIWSNLYAANLQDSLYWASEP 
SQFWDRWVRNQANLDKEQVPLLKIEEPPSTAGRIAEFFTDLLTWRPLAQATHNFLRGLHFHKDYFQHPHFSTWKA 
TXLDGLPNQLTPSEPHLCLLDVGYLINTSCLPLLQPTRDVDLILSLDYNLHGAFQQLQLLGRFCQEQGIPFPPIS 
PSPEEQLQPRECHTFSDPTCPGAPAVLHFPLVSDSFREYSAPGVRRTPEEAAAGEVNLSSSDSPYHYTKVXYSQE 
DVDKLLHLTHYNVCNNQEQLLEAIiRQAVQRRRQRRPH 
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FIGURE 77A 

CCAGACAGCCTTGTATGGGAAGATGGGAAGGGXGAGGTCGCCACATCCTTATGGGCACCGGAAGTTCATCACTAT 
GTCTGATGGAGCCACTTCTACATTCGACCTCTTCGAGCCCTTGGCTGAGCACTGTGTTGGAGATGATATCACCAT 
GGTCATCTGCCCTGGAATTGCCACACAATTCAAAAGTTGGATCTGAGTTTGGAGAAAGATATTTCCAACCTAAGT 
GGGTACTATTTTGAAACCAGATTTTTAATTTAATAGCCTATATTTGTAGTCTGTTGGATAGGTGTTTCCAAAGTG 
TGTCTTCTCAAGTGAAAACGCAACTCTAGGTTTCAAGTACTCCTTTTCTCCGATCCTGTGGTACTTGAATATCCA 
AAAACCCTGCACTTTGAACAATCAGCTGTTGCTATCTGGAACTAAACAGAACTATGAGTAAAATTGCCTGGATAC 
TTTAAAAAGATATTTTTCCTCCTTCATCTCCXTTGACTCCAGGACAGACTGGAATATAAGTAGTGGGTCTGCATG 
GATGTTTCAGGGATCAAAGGAGCCACCTGGGCGCCTGAGTGCCAACCCTCAGGGCCACAGGTGGGTGTGGTTTGG 
GCACGGGTCCAAGTGACTGTGACGGGACCCCTGGGCATGGGGCCAGGTCTGTAACCTGAAGAAGTTGTTTTCTGA 
CAATCACCAAATCATCCGAATGACATCAAAAGCAGCCCTTATCTCAGAGACCGAGATTTCTGTGGTCTCAACTTC 
GCTTTGGTATAATTTCTGGCACTCACCAGCCTCTATCATTATGACTTTCCCCCCAGTGTATTATTTCTCTAATAG 
GTTTCCTTTTCACGTTCTTTTAGCACAGACTGGCACTTTACCCTCTCAATTTGGAAGTTAGCCCCTCTCCTCTGT 
TACTXXTCCCTTCACCCAACTACAGCTGTGATCTAGAACATTCATAGTCATATTTCTGCTACTACTACCTTCATT 
TAXCAAGACXXXXXAXGAGAATAGGXAAACCAAGCAATAACTTCCXAGGACTGAATCACCACCCCAGAAGAGCGA 
GAGGCXCCXXTCATATGCCTGAGGCCACACCCCXXAACCTGTTCTGACAAAATAGTGGCTGGCCCATGTACCAGC 
TCCAXXCAGAAATXCAGGAAGAGAAAAGACAGCCCXGXXGXCACACAAACCGGXGTGGGGGAGGGTGGAGCCTGG 
TCTGCACGGCAGTCCXGGXGGCCCCXGTGGAGGACAGGCAGGGCXGGCAGCAXAGCCXXXGXTGCCACACAACCG 
GAATTXGCXCCCCCAGGACTGTGGGAGCCAGXGTCCCAGCTGAAATCTTTXXAGXGXGTGGCTCTGAATGGCACT 
CACATTCCATTTTGGCXCACATGAAACTAACTGAAGCCCTTTGXXCAAGCXXCAGGCTCXXAGGCATGGAAAXGA 
GAAXGTGACTGXGGCTGTCTTACAGGAAAATTCXXGXXXGXCCCXGAAXGAGAGCACAGAGGCATTGAAXXCACA 
GAGCTGCAAACTTGCCXGATAAATGAGGGAGXGGCAGXXTAXAGAXAGGXCATCXXXXTXCCXTCCXCCAGGTGX 
CCXTGCCTTXCTTCCCAAAGTCATTCAXTTCXGATGAGTAXATGAATCCCCCXCXTGCTAGTAAGGTTCTATTXG 
GGCXAAAACAAGGCTGAATTTTTAAAGAGXATTTGAAXAXATTTXAGAATCAAAXXGAGGCTATAAAXTGCAXCA 
AXCTGGACAATTCCAXXGCAGGAAXAATAXGTTAAAAACCAATGGGGAGAAGCACCCACAXCXCXCCXGTAGCAC 
TCCGXGTCXCATAAGCAATTTGAAGACACTXACAAGTAACTGAXTCCAGTCAAATTAGGAXTAACXGACTCAAAA 
AAXGGTGTCAAGXTTCTTTAATGXTTTTATGTXAGAAGXGAGXXXAACAGACTTGAAGAAAACXGXXATCTTXTC 
CTGCXGTGAGTTTACACAAATGATTCCAGAGCAGAAXGAAAGCAGAAAGCXGTXGGTTACAAXAXTCXTTTAACC 
.XCTCXGCAGCATTXXACACTTACTGGGAACCTTATGATTCACCGXAAGAGXGGAAATATACCXGAGTXCGTGTCC 
XAAXGGXCXCXAAXTCACAXXGGATCGXGGGCAAAXCACCXCACCTCXCXGAGCCXGTTCCCTCCXCXXAGACCA 
TCXCXAAGACCACTTCAXCXATTTACACAXCAXTTGCXTGAACATTGCTGAACAXCXGCGXGAACXXGGCCXCXC 
CAGCCCTTGCAGGTGGAAACAGCTGXGTCAAGGCTCAAGGCXCACGCTGAGGGGACXTGGAGGGAGGGGGCTTCX 
GCAXXAAGCXXXCCTGGXGAAGACCCXTGAXCXXGXCCAAAGCCCTGTGXCXXTGACTGGCXXCXCXXCAGAGXC 
CCCGTTGXCAXCGTAAGACCCXXGCXGXXXGGAGGGTGGXCXXGTGACTGXGGCAGCTGCTGGCCGCTGGAAXGA 
GGAGCCTATCTCCATCCTCCAGTGTGACTCAGGCAGAGCATTGAGAATXCCCAGGGCAGAAAXCCTTCCXGCTCA 
GGCXXXCATTCTAAAACTACAGXCXXCAXTAAAGCXGAACXTTCTGGGXAGCXGAGCXXATAXGCCCGGCAXCTG 
AAXGAGAGCXCXCXXXGTAACXGXGXGACXXGAGAXCXAGTXXGCCAGCXCCXGGGAAACAATACAXGXGTTCTX 
GXXXGXGTXXGCXCAGCAAGCAGAXGXCTGAGAXGTAAGAAGCXXXTCTXTTCCXGTGGCATTGATTCTGACTTA 
GAGCXGAAGTAAAGATCACXGAAACATCACGXCAAGXXGAAGXCACTCAXAGGXCXXXGTCCTXXAGGCAGGACA 
GGAGAGXCAXTAAGAAGCATTTCACXGXAGCATTCXAXCACAATAXCAXCXGGAAXXGTXXXCXXTGCCCAGAAA 
GCCXXAACXTGCCXCXAGAGAAXCCCTGGXATXACAACGATAXTGCGGCAXXAGAAXXCCAACXCXXCTGCTGTG 
GAAGTTXGAAGCGAAGCXGCAGCAAAACCAGAGAATTTCCXCAAGXGGCCTGTAGGCTCCXXGTTAXCTXATGCC 
CCCACCCCTCCCTCAACAAXATGAGTGAXCCAGAACTGGCCCAAACACCTCAGCTCXGGTCCCXXXXXGCCCXTC 
TXGGCCTXACXCXGTXGTXCAAAGCCACXXXGGAXTGCXTGGAXGCXTCGAACAGCCATGAAAAGXAGCCXGCCT 
GXGGCAXXTAGAGGCCAAGCAATXGACAGAAAGGGTTTCTTCTACCTCXGXXATCTAAGCAGAGGGAAGXAAACT 
XCXCACCGCCCCCCACCCCTCACXGCCCCCGATXACACTAGAATXGCXXXCGCCAAAXTGXAGXXGAAGCXAAGG 
AAGGGGAAXCXGGCCCCXGCTGGGAGAGGGAACXGGAAXGCCACACAAGGCAAGGCCTGCXXCCXXCCXXCCCCX 
CTGCXGCTGCXGCCXCGGAACGCTGCAGCCCAGGCXTCCTCCCACAGXGGCCCTTGGAAGCAGGCCGCAGAGTAG 
ACAGCXGCXCCXXXXGGAAGAGXCAGXCCCCXGTGXXTXCXGAACXGXXXXTCCXAGCAXGXAXGXGGGXAGAGC 
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FIGURE 77B 

TTTCATGCATCTCTAGTAATAATAAGCTGAAATTAGTTTTTTTTTTAATTCTCCAATTTAAAACTTTTAATTAAA 
AAGTAAATTTTAATGTCGAAAATGCAAACTTGGGGAGGGCAGAAAGATCACACACAAGGCTGTCACTTCATACTT 
GCAGGATTGCACAGCAGCCGGGCAGAGGCGCTCCTCACTTCCCAGATGGGGCGGCGGGCAGCAGAGACGCACCTC 
ACTTCCTAGACAGTGCGGCACCAGGCACAGGCACACCTCACTTCCCAGACAGTTGGGCGGCCCAGGCAAGCGCTC 
CTCACTTCCCCAGATGGGGCGGCTCCCGGGAAGCGGGGCTCCCTCACTTCCCAGACAGGGTGGCCAGGCAGAGGT 
GCTCCTCACTTCCCAGAACAATTCTTTATGAATTTGATAAAGGACTGAAGTGCAACTGAAAGCTGCTAGTGATGA 
TCTGGTAATATACAATTTGTCCAGTAGCCAGTTTGTTTTTATTGTGTTTTCTAACCATAAGAGATCATTAAAGGC 
AAAGCCTGTATGACGCTGTACACACACAAAAAAATGGTCACCGCAGGCCATACTACCAATGAAATGGTAGGTAAA 
CAAATCTTCTGGTCAAGAGAAAAAAAAAAGAAATAGCACTCTGCATGCTTTGCTCTACAAGATGAATTTCCCTAG 
AAAGAAXCCAATGAAGGCCGGGCATAGTGGCTCACT 
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FIGURE 78 

QTALYGKMGRVRSPHPYGHRKFITMSDGATSTFDLFEPLAEHCVGDDITMVICPGIATQFKSWI 
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FIGURE 79 

GAGATGCCTGAACCCAGCCTTGGGGAATCTCACAGCCCTCACCTACCTCAACCTCAGCCGAAACCAGCTGTCGCT 
GCTGCCACCCTACATCTGCCAGCTGCCCCTGAGGGTCCTCATCGTCAGCAACAACAAGCTGGGAGCCCTGCCCCC 
TGACATCGGCACCCTGGGAAGCCTGCGACAGCTTGACGTGAGCAGCAACGAGCTCCAATCCCTGCCCTCGGAACT 
GTGTGGCCTCTCTTCCCTGCGGGAGCTCAATGTCCGGAGGAACCAGCTCAGTACGCTGCCCGAAGAGCTGGGGGA 
CCTCCCTCTGGTCCGCCTGGATTTCTCCTGTAACCGCGTCTCCCGAATCCCAGTCTCCTTCTGCCGCCTGAGGCA 
CCTGCAGGTCATTCTGCTGGACAGCAACCCTCTGCAGAGTCCACCTGCCCAGGTCTGCCTGAAGGGGAAACTTCA 
CATCTTCAAGTATTTGTCCACAGAGGCCGGGCAGCGTGGGTCGGCCCTGGGGGACCTGGCCCCTTCTCGGCCCCC 
GAGTTTCAGTCCCTGCCCTGCAGAGGATCTATTTCCGGGACATCGGTACGATGGTGGGCTGGACTCAGGCTTCCA 
CAGCGTTGATAGTGGCAGCAAGAGGTGGTCTGGAAATGAGTCAACAGATGAATTTTCAGAGCTGTCATTCCGGAT 
CTCAGAGCXGGCCCGGGAGCCCCGGGGGCCCAGAGAACGCAAGGAGGATGGCTCAGCGGACGGAGACCCTGTGCA 
GATTGACTTCATCGACAGCCATGTCCCCGGGGAGGATGAAGAGCGAGGCACTGTGGAGGAGCAGCGACCACCCGA 
ATTAAGCCCTGGGGCAGGGGACAGGGAGAGGGCACCAAGCAGCAGGCGGGAGGAGCCGGCAGGGGAGGAGCGGCG 
GCGCCCGGACACCTTGCAGCTGTGGCAGGAGCGGGAACGGCGGCAGCAGCAGCAGAGCGGGGCGTGGGGGGCCCC 
GAGGAAGGATAGCCTCTTGAAGCCAGGGCTCAGGGCTGTTGTGGGAGGGGCCGCCGCCGTGTCCACTCAAGCCAT 
GCACAACGGCTCGCCTAAGTCCAGTGCCTCCCAAGCAGGGGCTGCAGCGGGGCAGGGAGCCCCCGCCCCTGCCCC 
TGCCTCCCAAGAGCCCCTTCCCATAGCTGGACCAGCGACAGCACCTGCTCCACGGCCACTTGGCTCCATTCAGAG 
ACCAAACAGCTTCCTCTTCCGTTCCTCCTCTCAGAGTGGCTCAGGCCCTTCCTCACCAGACTCTGTCCTGAGACC 
TCGGCGGTACCCCCAGGTTCCAGATGAGAAGGACTTAATGACTCAGCTGCGCCAGGTCCTTGAGTCCCGGCTGCA 
GCGGCCCCTGCCTGAGGACCTGGCCGAGGCTCTGGCCAGTGGGGTCATCCTGTGCCAGCTGGCCAACCAGCTACG 
GCCGCGCTCCGTGCCCTTCATCCATGTGCCCTCCCCTGCTGTGCCAAAACTCAGTGCCCTCAAGGCTCGGAAGAA 
TGTGGAGAGTTTTCTAGAAGCCTGTCGAAAAATGGGGGTGCCTGAGGCTGACCTGTGCTCGCCCTCGGATCTCCT 
CCAGGGCACTGCCCGGGGGCTGCGGACCGCGCTGGAGGCCATGAAGCGGGTGGGGGGCAAGGCCCTACCGCCCCT 
CTGGCCCCCCTCTGGTCTGGGCGGCTTCGTCGTCTTCTACGTGGTCCTCATGCTGCTGCTCTATGTCACCTACAC 
TCGGCTCCTGGGTTCCTAGGCCCCAAAATCGGCCCTCCCTCACCCCTTTCCCTTCCTCTCTATTTATAAGGTCCC 
TGCXCCACCCGACCCCACCTGCGGTGCCTTCAGCCCCAACCAAAGACACTAGTGCACCCCCTTCACAGACACTGA 
CCTCAGAGGCCCCACTCTGGTGCCCCCAGACCCTGGGCCCCCAGCCTCXGGCCTCCCXCCAGTAGCCCCACGAGT 
CCCCACCXCTCAGXGCXGACGGXGCCXXCAXGTCCCCGCCGGCCCXGCCCCXGCCCXCXGXACCCCGTGAGGGGX 
GGCAGGAGCXGGAGXCXCCCCCTTCCXCCTGTGCCCXCCCCXXCCCCCCCCAACAGCXGCXAXGGGGGGGCXAAA 
TXAXCXCXAXXXXGXAGAGAGGAXCXAXAXXXGXAGGGGXXCGGGGCCCAGGCCGGGXCCCXAXCXCXGXGXAXA 
AACXGTACAGACCGXGGCCGCCCXGCCTGXGXGXGXGXGXGTGCGCGCGCGCGCGCGTCTGCXCCGCGXGXXGGX 
GGCXGXGGCCAXGGCXCTGTGCCCACCAGCATCXCCCTCCXGAGATGCCGGCCTCXCAXGCTCCCGGAGCGXCCG 
CCAACCCCCCGXGXCACCXCCCXXCXGXXAXCGCXGACAGCXXXCXXGCGXCXCAXXXGXCGCCGAGCCCCGAGC 
GCACGGXGAXGCTCGGGXCXGCCCCCGACCCCCXGCCACAGGCCGGAAGCCGCAGGGGGCACCGXGGGGAAGCXA 
ACCCGGCCCCXXCCCCCAGGAGTCACTGTGCCAGCCCCACCACAXCCTGGAAGAGGAGGGGGCCCCGGGAAGGGG 
CCTCCCCXACAXCGCXGCXGXCGXCCACGCCCXGCXGGACCGGCCXXAGGGGXGAAGGXGGGGGCACAGGGCCCA 
CCCCGCCCGGGACCCCGGGAGCAGAAGACGCGCCXGGGCXCCGCGCXCXCAGAGAAGCACGTGGXGAGGGTGGCC 
XGGGCCTGGGCACCCXXGGCCXGCCGGCCXGGCXGCCTCXGGGGTCGAGGGTCTGGGXGGAAGGACCGCGGGXGX 
CGCAGGTGXCGXGXCAGCCGCAAXAAACAGAAGCCAGAAGCCCXCXGGGXGGCCCXCAGAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA2\AAAAAAAAA 
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FIGURE 80A 

CTGCCCACCATCTTTGTCCCTGGCAAAGTGGGTTTTGCGCAGTGGCTTAGACCTAGAAAAGAATCGTGACGGGCA 
GGAAACCATTACACCACCACCTGGGCTGTGCTCTCCGGCTCCCGCCGCCACCCCCGCCCTCGCCTTCGCCTCCGC 
TCCGGTGCACATTAAAGATCCAAAGTCATGACTGACTCCAAGTATTTCACAACCAATAAAAAAGGAGAAATATTT 
GAACTAAAAGCTGAACTCAACAATGAAAAGAAAGAAAAGAGAAAGGAGGCTGTGAAGAAAGTGATTGCTGCXATG 
ACCGTGGGGAAGGATGTTAGTTCTCTCTTTCCAGACGTAGTGAACTGTATGCAGACTGACAATCTGGAACTAAAG 
AAGCTTGTGTATCTCTACTTGATGAACTACGCCAAGAGTCAGCCAGACATGGCCATCATGGCTGTAAACAGCTTT 
GTGAAGGACTGTGAAGATCCTAATCCTTTGATTCGAGCCTTGGCAGXCAGAACCATGGGGTGCATCCGGGTAGAC 
AAAATTACAGAATATCTCTGTGAGCCGCTCCGCAAGTGCTTGAAGGATGAGGATCCCTATGTTCGGAAAACAGCA 
GCAGTCTGCGTGGCAAAACTCCATGATATCAATGCCCAAATGGTGGAAGATCAGGGATTTCTGGATTCTCTACGG 
GATCTCATAGCAGATTCAAATGCAATGGTGGTGGCTAATGCCGTAGCGGCATTATCTGAAATCAGTGAGXCTCAC 
CCAAACAGCAACTTACTTGATCTGAACCCACAGAACATTAATAAGCTGCTGACAGCCCTGAATGAATGCACTGAA 
TGGGGCCAGATTTTCATCCTGGACTGCCTGTCTAATTACAACCCTAAAGATGATCGGGAGGCTCAGAGCATCTGT 
GAGCGGGTAACTCCCCGGCTATCCCATGCCAACTCAGCAGTGGTGCTTTCAGCGGTAAAAGXCCXAATGAAGTTT 
CTAGAATTGXXACCXAAGGATTCXGACXACTACAATATGCXGCTGAAGAAGXXAGCCCCTCCACTXGTCACXTTG 
CTGTCTGGGGAGCCAGAAGTGCAGXAXGXCGCCCTGAGGAACATCAACTTAAXXGTCCAGAAAAGGCCXGAAATC 
XTGAAGCAGGAAATCAAAGXCTXCTTTGTGAAGTACAAXGATCCCATCXAXGXXAAACXAGAGAAGXTGGACAXC 
ATGATTCGTXXGGCATCTCAAGCCAACAXTGCTCAGGTTCXGGCAGAACXGAAAGAATATGCXACAGAGGXGGAT 
GXTGACTTXGXXCGAAAAGCXGTGCGGGCCAXXGGACGGTGXGCCATCAAGGXGGAGCAATCXGCAGAGCGCTGT 
GTAAGCACATTGCTTGAXCTAATCCAGACCAAAGXGAATTATGTGGTCCAAGAAGCAAXXGTTGXCATCAGGGAC 
AXCXTCCGCAAATACCCCAACAAGTATGAAAGXAXCAXCGCCACXCTGTGTGAGAACTXAGACXCGCXGGAXGAG 
CCAGATGCXCGAGCAGCTATGAXXTGGATTGTGGGAGAAXATGCTGAAAGAAXXGACAATGCAGAXGAGXXACTA 
GAAAGCTTCCTGGAGGGTTTTCACGATGAAAGCACCCAGGTGCAGCTCACTCTGCXTACXGCCATAGTGAAGCTG 
TTTCTCAAGAAACCAXCAGAAACACAGGAGCXAGTCCAGCAGGTCXXGAGTTXGGCAACACAGGAXXCXGATAAT 
CCTGACCXTCGAGACCGGGGCXAXATTXAXXGGCGCCXXCTCTCAACXGACCCTGTXACAGCXAAAGAAGTAGXC 
TTGTCTGAGAAGCCACTGAXCXCTGAGGAGACGGACCTTATTGAGCCAACTCTGCTGGAXGAGCTAATCTGCCAC 
AXXGGTTCTXXGGCCXCTGXGTATCAXAAGCCTCCCAATGCXXXTGTGGAAGGAAGTCATGGAAXTCAXCGXAAA 
CACXTGCCAATTCATCAXGGGAGCACTGAXGCAGGTGACAGCCCXGTXGGCACTACCACXGCAACGAACCTGGAA 
CAGCCTCAGGTTATCCCCXCXCAAGGTGATCXTCXAGGGGATCTTTXAAACCTTGACCTCGGTCCCCCAGTCAAT 
GXGCCACAGGXGXCCXCCAXGCAGAXGGGAGCAGXGGAXCXCCXAGGAGGAGGACXAGATAGXCXGGXGGGACAA 
TCCTTCATCCCAXCATCGGTGCCXGCAACCXTTGCTCCXXCACCTACACCXGCTGTGGTCAGCAGXGGACTGAAX 
GACCTGTTTGAACXCTCCACAGGGATAGGCAXGGCACCTGGTGGAXATGTGGCTCCTAAGGCTGTCTGGCXACCX 
GCAGTAAAGGCXAAAGGCTTGGAGAXXXCCGGAACAXXXACTCACCGCCAAGGGCACATCXAXAXGGAAAXGAAC 
XXCACCAATAAAGCTCTGCAGCACAXGACAGATTTXGCAATCCAGXXXAACAAAAAXAGCXTXGGXGTCATCCCC 
AGCACTCCTCTGGCCATCCATACACCACTGATGCCAAACCAGAGCATTGATGXCTCCCTGCCTCTCAATACCTTG 
GGCCCAGXCAXGAAGAXGGAACCXCXGAAXAACCXCCAGGTGGCXGXGAAAAACAAXAXCGAXGXCTTCXACXXC 
AGCXGCCXCAXCCCACTCAATGXGCTTXXTGTAGAAGAXGGCAAAATGGAGCGCCAGGTCTXCCXXGCAACATGG 
AAGGATATTCCCJ^XGAAAAXGAACTXCAGTTTCAGATXAAGGAATGTCATTTAAAXGCXGACACTGXXXCCAGC 
AAGTTGCAAAACAACAATGTXTATACTAXTGCCAAGAGGAATGXGGAAGGGCAGGACATGCXGXACCAATCCCTG 
AAGCTCACXAATGGCATXXGGAXXXXGGCCGAACXACGXAXCCAGCCAGGAAACCCCAAXXACACGCTGTCACXG 
AAGTGXAGAGCTCCTGAAGTCTCXCAATACAXCTAXCAGGTCTACGACAGCATTTTGAAAAACJCAAPAAGACTGG 
XCCAGXACCCXXCAACCATGCXGXGATCGGTGCAAGXCAAGAACTCXXAACTGGAAGAAAXXGXAXXGCTGCGTA 
GAAXCTGAACACACXGAGGCCACCTAGCAAGGTAGTAACXAGXCXAACCXGXGCXAACATXAGGGCACAACCXGX 
TGGATAGXXXXAGCTTCCTGXGAACAXTTGTAACCACTGCTTCAGXCACCXCCCACCTCXXGCCACCTGCXGCXG 
CTATCTGTCCTTACTTGTGGGCXXCTCCATGCTGTGCCAATGGCTGGCTXXTTCTACACCCXCXTXTGAGXGXAG 
TXXGGXAXXXTGXAATXGAGAGCTCAXXXCAAAAGCAGAAAAAGACAACAAATAXTAAAGCAAGGAAAAGXGXAA 
CXGAAACACXGCACXTTACTGTTTTATACTXXXGTACATAXGAGAAAXC2U^GGGATXAGTGCAACCAGTAGAAGG 
CATXGAAAXGACTGTCATTAACCACACAGTCCTGGAGGCAGAGATGCAGTXACCXACCCTAGCTXTXGAXGGGXX 
CXCTXACCXGXAGTAGCCXTATCCCXGGXCAXTXGGAXXXXCAGXXXGCXXTXXTCXXXXXXXCCCCXCCAAACX 
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FIGURE SOB 

CCTTTTCCTTGGCCAAGCCTTCATGCTTCCCCCTTTCCATATTATAATCTCATTTGATTGCTCTGCAGTTGGGAA 
CGGTGATCTTCTTGAATGATGTTTCAGTGTGCAAAAACTATAGAGCCTGTCAGCACCAAAGCTGACAGAAGTTAT" 
ACCTTACTCCTTTCCTTTCCCCTGAACAAACCTGCTAATCCCACTAATTCAGGAATTTGAGTAGAGATGGGGAAC 
AAGAACCCAGATGCTGTCCCCTCACCCCCTCTCCTGTATTTCTCAGGTCCAGTTCAAATCTAAAATXCTACTTTT 
AGAGTTGAAACAGAGTAATAACTTATCTAACCCTCTTTTCCTACAAAGGAGAAAGATAAAAGGCACAAAGGTTAC 
CGCCAAGGCCCGTCAGCTGTGTAGTGGCAAAGCCGAGACCGAGTCTCCTAAGTCCCCGTCAGTGTGGTTTTCACC 
ACAGGACXGXCTCTTGTCGTTTTCCCCTAATGCCTTCTCCTGCCTTTTCTGTGCCTAGTTTTTGGCTCTTCACAT 
ATTCCATATTGATTTTGACGCTCTGTATATTGGCATCAGGTGGCAGCTGAATATCTTTTGAATTACTCGAAGGTA 
AAGCCAGATGCCAGAATGAAGGTGTAGCCAGTGTTTCCCATATGCCCCTGGAGCCCCACTTATTGAGGCCAGCAG 
AATAGGTGCAGAGATGAAGTGAGCTTAGAGATGTTGCAAATGCTCTTTATCCCTTCAGCTCTCXGATCTGCTCTT 
XCXXCAXGAXACTXAGTCTGCAGGGCATATTAAGATCATCCCAGAGGXTCAGGCAGTXCCTGTCAXCXCXGAAAA 
GACXGGGGGATATGAAAXCTTCCCCCTACCCCACXTAAXGCGXXGGAXAXGAXTTXTCAAAGAAXGCTTCAXGCC 
CAAAATACCAGCCXGXXXAGCAGXGXTACACXGXXXGAXCXGCGGGCACXXGTTGCAXXGCCXGGCACCCAAXAT 
TCAGGGXCCAXGACTAAGACTGGTCXXCTCAGAXGCCCTGCTTAAATCAGGGGCACTTCAGGCXCCACAGGCGTC 
ATGTTGGACXGAGACCXAACXCACTGGACTCAGAGGAGGAATCGTGGAAAACAAGAGCAAAACTACCCCACACCC 
CXAXXTCAXGXCXGAAAXAACCCTGXXXCAXACCAGXXGCAAAGCXXGXGGGGAGCGGXCCCACAAAGCACXXXC 
TTAAACCXXGAGAATCTCCAAGAGAAAAATATXXGGGGAAGGAGGGAGGAAAXAXGTCCCTXGCACACCACCCCX 
GAAGCACAXGGCAGTAGGAAACAGCATAGGATXGTAXGTGGGAGGXGGAXAGGTCGGTGATGXGXGGAGCGGAAA 
AGCAGGTXGGTAAAGXXCCCTXCTTGGGACXXATTCCTGGAGXCAGTGGATACAAGXAGXGCAGAAGGXXCACAC 
TGCAAATAGTGXTCXCAXCTCAAAGCAAACXATCAXXCCAGAAGGAAAAGXGTGTCAGGGCAAGCAGACAACACA 
AXTXCCXAXCAGAATATGTCCCXCAACCCCCGAAACAAGGCTTCTCXCAGCCXCCCCACCAGXGAXGGATAACAG 
CXCCTATXCTCAGCXGACCXGACTGAGCCAACCCAXGAACTCXXCACTCCTTGGGGAAGCCACCXCCCAXCACAC 
CCCXGAGCAGAGXXAGGGAGGAAXXCXACXXCCCATAAAAGGACCXCTCCTGAGAGGCAAAACCXGXXGCCTCCA 
CCACGGCTTCCCTCXTGGCXCATTCCAAGCTXGGCCAAATXGGGGAAGXGGGATGGAGGXXGCCCTGCATCCCCC 
CTCCXCXGCCTGAGTGXGXCXTTGTAATGTCAGCTGGCATCAXACAAAGAGCAGGAGAAGCAAACACCCAGAACX 
CXTTTGCTGGXCAGAGAXXCCCXGAGXGTCXGXCCXCACCCAAGCCTGCXCXGXGTCXGXGXXGXGAAGCTTGAG 
ACXCTGGAAAGAAAXGGGGAGGGGGGGCAGGGGAAATGTTGCCCXAAGAATGCTTCXCAXTCCXCTGXTCXXATT 
GGGXCCTGTTTTTCGGGAGGGTGGGGGXXGGGGGAAGCTTGACCXTGTGTCTXCGXCAAXAAACTCACAXXXACA 
C 
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FIGURE 81 

MTDSKYFTTNKKGEIFELKAELNNEKKEKRKEAVKKVIAAMTVGKDVSSLFPDVVNCMQTDNLELKKLVYLYLMN 
YAKSQPDMAIMAVNSFVKDCEDPNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLHD 

INAQMVEDQGFLDSLRDLIADSNPMWANAVAALSEISESHPNSNLLDLNPQNINKLLTALNECTEWGQIFILDC 
LSNYNPKDDREAQSICERVTPRLSHANSAWLSAVKVLMKFLELLPKDSDYYNMLLKKLAPPLVTLLSGEPEVQY 
VALRNINLIVQKRPEILKQEIKVFFVKYNDPIYVKLEKLDIMIRLASQANIAQVLAELKEYATEVDVDFVRKAVR 
AIGRCAIKVEQSAERCVSTLLDLIQTKVNYWQEAIWIRDIFRKYPNKYESIIATLCENLDSLDEPDARAAMIW 
IVGEYAERIDNADELLESFLEGFHDESTQVQLTLLTAIVKLFLKKPSETQELVQQVLSLATQDSDNPDLRDRGYI 
YWRLLSTDPVTAKEWLSEKPLISEETDLIEPTLLDELICHIGSLASVYHKPPNAFVEGSHGIHRKHLPIHHGST 
DAGDSPVGTTTATNLEQPQVIPSQGDLLGDLLNLDLGPPVNVPQVSSMQMGAVDLLGGGLDSLVGQSFIPSSVPA 
TFAPSPTPAWSSGLNDLFELSTGIGMAPGGYVAPKAVWLPAVKAKGLEISGTFTHRQGHIYMEMNFTNKALQHM 
TDFAIQFNKNSFGVIPSTPLAIHTPLMPNQSIDVSLPLNTLGPVMKMEPLNNLQVAVKNNIDVFYFSCLIPLNVL 
FVEDGKMERQVFLATWKDIPNENELQFQIKECHLNADTVSSKLQNNNVYTIAKRNVEGQDMLYQSLKLTNGIWIL 
AELRIQPGNPNYTLSLKCRAPEVSQYIYQVYDSILKN 
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FIGURE 82 

GGCACGAGGGCGCGGAGCGGAGCGGCGGCGGGCGCAGCTAGCGGGTCGGCCGCGGAGCGGAGGTGCAGCTCGGCT 
TCCCCCGGCACCCCTCCCCCTCGGGCGCCAGCCCCACCCCTCCGCCGGCCGGGCCGACCCCGCCGTACTATCCCC 
TGCGGCGCGAGCCCGGGGCGGCTCCAAGCGCCCCCCAGCAGACCCCCATCATGGGCAGCCAGAGCTCCAAGGCTC 
CCCGGGGCGACGTGACCGCCGAGGAGGCAGCAGGCGCTTCCCCCGCGAAGGCCAACGGCCAGGAGAATGGCCACG 
TGAAAAGCAATGGAGACTTATCCCCCAAGGGTGAAGGGGAGTCGCCCCCTGTGAACGGAACAGATGAGGCAGCCG 
GGGCCACTGGCGATGCCATCGAGCCAGCACCCCCTAGCCAGGGTGCTGAGGCCAAGGGGGAGGTCCCCCCCAAGG 
AGACCCCCAAGAAGAAGAAGAAATTCTCTTTCAAGAAGCCTTTCAAATTGAGCGGCCTGTCCTTCAAGAGAAATC 
GGAAGGAGGGTGGGGGTGATTCTTCTGCCTCCTCACCCACAGAGGAAGAGCAGGAGCAGGGGGAGATCGGTGCCT 
GCAGCGACGAGGGCACTGCTCAGGAAGGGAAGGCCGCAGCCACCCCTGAGAGCCAGGAACCCCAGGCCAAGGGGG 
CAGAGGCTAGTGCAGCCTCAGAAGAAGAGGCAGGGCCCCAGGCTACAGAGCCATCCACTCCCTCGGGGCCGGAGA 
GTGGCCCTACACCAGCCAGCGCTGAGCAGAATGA GTAGC TAGGTAGGGGCA6GTGGGT6ATCTCTAAGCTGCAAA 
AACTGTGCTGTCCTTGTGAGGTCACTGCCTGGACCTGGTGCCCTGGCTGCCTTCCTGTGCCCAGAAAGGAAGGGG 
CTATTGCCTCCTCCCAGCCACGTTCCCTTTCCTCCTCTCCCTCCTGTGGATTCTCCCATCAGCCATCTGGTTCTC 
CTCTTAAGGCCAGTTGAAGATGGTCCCTTACAGCTTCCCAAGTTAGGTTAGTGATGTGAAATGCTCCTGTCCCTG 
GCCCTACCTCCTTCCCTGTCCCCACCCCTGCATAAGGCAGTTGTTGGTTTTCTTCCCCAATTCTTXTCCAAGTAG 
GTTTTGTTTACCCTACTCCCCAAATCCCTGAGCCAGAAGTGGGGTGCTTATACTCCCAAACCTTGAGTGTCCAGC 
CTTCCCCTGTTGTTTTTAGTCTCTTGTGCXGTGCCTAGTGGCACCTGGGCTGGGGAGGACACTGCCCCGTCTAGG 
TTTTTATAAATGTCTTACTCAAGTTCAAACCTCCAGCCTGTGAATCAACTGTGTCTCTTTTTTGACTTGGTAAGC 
AAGTATTAGGCTTTGGGGTGGGGGGAGGTCTGTAATGTGAAACAACTTCTTGTCTTTTTTTCTCCCACTGTTGTA 
AATAACTTTTAATGGCCAAACCCCAGATTTGTACTTTTTTTTTTTTCTAACTGCTAAAACCATTCTCTTCCACCT 
GGTTTTACTGTAACATTTGGAAAAGGAATAAATGTCGTCCCTTTTTAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 83 

MGSQSSKAPRGDVTAEEAAGASPAKANGQENGHVKSNGDLSPKGEGESPPVNGTDEAAGATGDAIEPAPPSQGAE 
AKGEVPPKETPKKKKKFSFKKPFKLSGLSFKRNRKEGGGDSSASSPTEEEQEQGEIGACSDEGTAQEGKAAAXPE 
SQEPQAKGAEASAASEEEAGPQATEPSTPSGPESGPTPASAEQNE 
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FIGURE 84 

GCGGAGCGTGTGAGCAGTACXGCGGCCTCCTCTCCTCTCCTAACCTCGCTCTCGCGGCCTACCTTTACCCGCCCG 

CCTGCTCGGCGACCAGCGGGGATCCTCCCCCAGCCGCAAGTCCACGAAGAAAGCAACGAATGAAAATTATGAAGA 

CAACGAGAAGTCAGACTCCTCCGGGTCGCGCTCCAGCTGCTTCGGCTTCGTCGCCTACTCTGTGAACTCCGGGGA 

GAGATCTCGAGTCAAGATTAAGACCTTAACCCACCAACCTGCCTGTTCGGACACCCCCCGGGCCGGCCGCTGTCT 

GTCCCCTTCTCCATCGCCCTCTCCCAGAAAGCTCCGGTGCTTGGACCAGCTAGAGTCTGAGAAAGAGGAGAGGCG 

CGAACGCCACTCCAAAAAGAGAAGGGTTAAAGAGGGCAACCCTAACGATACGCTTGACTTTCTGTGGCTGGGAAC 

ACCTTCCACC ATGA CCACCTCAGCAAGTTCCCACTTAAATAAAGGCATCAAGCAGGTGTACATGTCCCTGCCTCA 

GGGTGAGAAAGTCCAGGCCATGTATATCTGGATCGATGGTACTGGAGAAGGACTGCGCTGCAAGACCCGGACCCT 

GGACAGTGAGCCCAAGTGTGTGGAAGAGTTGCCTGAGTGGAATTTCGATGGCTCCAGTACTTTACAGTCTGAGGG 

TTCCAACAGTGACATGTATCTCGTGCCTGCTGCCATGTTTCGGGACCCCTTCCGTAAGGACCCTAACAAGCTGGT 

GTTATGTGAAGTTTTCAAGTACAATCGAAGGCCTGCAGAGACCAATTTGAGGCACACCTGTAAACGGATAAXGGA 

CATGGTGAGCAACCAGCACCCCTGGTTTGGCATGGAGCAGGAGTATACCCTCATGGGGACAGATGGGCACCCCTT 

TGGTTGGCCTTCCAACGGCTTCCCAGGGCCCCAGGGTCCATATTACTGTGGTGTGGGAGCAGACAGAGCCTATGG 

CAGGGACATCGTGGAGGCCCATTACCGGGCCTGCTTGTATGCTGGAGTCAAGATTGCGGGGACTAATGCCGAGGT 

CATGCCTGCCCAGTGGGAATTTCAGATTGGACCTTGTGAAGGAATCAGCATGGGAGATCATCTCTGGGTGGCCCG 

TTTCATCTTGCATCGTGTGTGTGAAGACTTTGGAGTGATAGCAACCTTTGATCCXAAGCCCATTCCTGGGAACTG 

GAATGGTGCAGGCTGCCATACCAACTTCAGCACCAAGGCCATGCGGGAGGAGAATGGTCTGAAGTACATCGAGGA 

GGCCATTGAGAAACTAAGCAAGCGGCACCAGTACCACATCCGTGCCTATGATCCCAAGGGAGGCCTGGACAATGC 

CCGACGTCTAACTGGATTCCATGAAACCTCCAACATCAACGACTTTTCTGCTGGTGTAGCCAATCGTAGCGCCAG 

CATACGCATTCCCCGGACTGTTGGCCAGGAGAAGAAGGGTTACTTTGAAGATCGTCGCCCCTCTGCCAACTGCGA 

CCCCTTTTCGGTGACAGAAGCCCTCATCCGCACGTGTCTTCTCAATGAAACCGGCGATGAGCCCTTCCAGTACAA 

AAAT TAA GTGGACTAGACCTCCAGCTGTTGAGCCCCTCCTAGTTCTTCATCCCACTCCAACTCTTCCCCCTCTCC 

CAGTTGTCCCGATTGTAACTCAAAGGGTGGAATATCAAGGTCGTTTTTTTCATTCCATGTGCCCAGTTAATCTTG 

CTTTCTTTGTTTGGCTGGGATAGAGGGGTCAAGTTATTAATTTCTTCACACCTACCCTCCTTTTTTTCCCTATCA 

CTGAAGCTTTTTAGTGCATTAGTGGGGAGGAGGGTGGGGAGACATAACCACTGCTTCCATTTAATGGGGTGCACC 

TGTCCAATAGGCGTAGCTATCCGGACAGAGCACGTTTGCAGAAGGGGGACTCTTCTTCCAGGTAGCTGAAAGGGG 

AAGACCTGACGTACTCTGGTTAGGTTAGGACTTGCCCTCGTGGTGGAAACTTTTCTTAAAAAGTTATAACCAACX 

XTXCTATTAAAAGXGGGAATTAGGAGAGAAGGXAGGGGTTGGGAAXCAGAGAGAATGGCXTXGGTCXCXTGCTXG 

XGGGACXAGCCXGGCTTGGGACTAAATGCCCTGCXCTGAACACGAAGCXXAGXAXAAACTGATGGATATCCCTAC 

CTXGAAAGAAGAAAAGGXXCTTACTGCXXGGXCCXXGATTXATCACACAAAGCAGAAXAGXAXXXTXAXAXTXAA 

ATGTAAAGACAAAAAACTATATGTATGGTXXTGTGGAXXATGXGXGTTTTGCTAAAGGAAAAAACCATCCAGGTC 

AC GGGGCAC CAAAX T T GAGACAAAXAGXCGGATX AGAAAXAAAGCAX C X CAX X X XGAGX AGAGAGC AAGGGAAGX 

GGXXCXXAGAXGGXGATCXGGGAXXAGGCCCXCAAGACCCXXXXGGGXXXCXGCCCXGCCCACCCXCXGGAGAAG 

GXGGGCACXGGAXXAGXXAACAGACGACACGXXACTAGCAGXCACXXGATCXCCGTGGCXTTGGTTXAAAAGACA 

CACXXGXCCACAXAGGXTXAGAGAXAAGAGXXGGCTGXXCAACXXGAGCAXGXTACTGACAGAGGGGGXATXGGG 

GXXATXXXCTGGXAGGAAXAGCAXGXCACXAAAGCAGGCCXXXXGAXAXXAAAXXXXXXAAAAAGCAAAAXXAXA 

GAAGTTXAGATXXTAAXCAAAXXXGXAGGGXXXCXAGGXAAXXXXXACAGAAXTGCXXGXTXGCTTCAACTGTCT 

CCXACCXCXGCXCXXGGAGGAGATGGGGACAGGGCTGGAGXCAAAACACXXGXAAXXXXGXAXCXXGAXGXCXXT 

GXXAAGACXGCXGAAGAAXXAXXXXXXXXCXXXXAXAAXAAGGAATAAACCCCACCXXXAXXCCXXCAXXXCATC 

XACCAXXXXCXGGXXCXXGXGXXGGCXGXGGCAGGCCAGCXGXGGXXXXCXTXXGCCAXGACAACTXCXAAXXGC 

CAXGXACAGXAXGXXCAAAGXCAAAXAACTCCXCATXGXAAACAAACXGTGXAACXGCCCAAAGCAGCACXXAXA 

AAXCAGCCXAACAXAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 85 

MTTSASSHLNKGIKQVYMSLPQGEKVQAMYIWIDGTGEGLRCKTRTLDSEPKCVEELPEWNFDGSSTLQSEGSNS 
DMYLVPAAMFRDPFRKDPNKLVLCEVFKYNRRPAETNLRHTCKRIMDMVSNQHPWFGMEQEYTLMGTDGHPFGWP 
SNGFPGPQGPYYCGVGADRAYGRDIVEAHYRACLYAGVKIAGTNAEVMPAQWEFQIGPCEGISMGDHLWVARFIL 
HRVCEDFGVIATFDPKPIPGNWNGAGCHTNFSTKAMREENGLKYIEEAIEKLSKRHQYHIRAYDPKGGLDNARRL 
TGFHETSNINDFSAGVANRSASIRIPRTVGQEKKGYFEDRRPSANCDPFSVTEALIRTCLLNETGDEPFQYKN 
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FIGURE 86 

AGTATGTGTGGTTGGGGAATTCATGTGGAGGTCAGAGTGGAAGCAGGTGTGAGAGGGTCCAGCAGAAGGAAACAT 
GGCTGCCAAAGTGTTTGAGTCCATTGGCAAGTTTGGCCTGGCCTTAGCTGTTGCAGGAGGCGTGGTGAACTCTGC 
CTTATATAATGTGGATGCTGGGCACAGAGCTGTCATCTTTGACCGATTCCGTGGAGTGCAGGACATTGTGGTAGG 
GGAAGGGACTCATXTTCTCATCCCGTGGGTACAGAAACCAATTATCTTTGACTGCCGTTCTCGACCACGTAATGT 
GCCAGTCATCACTGGTAGCAAAGATTXACAGAATGTCAACATCACACTGCGCATCCTCTTCCGGCCTGTCGCCAG 
CCAGCTTCCTCGCATCTTCACCAGCATCGGAGAGGACTATGATGAGCGTGTGCTGCCGTCCATCACAACTGAGAT 
CCTCAAGTCAGTGGTGGCTCGCTTXGATGCTGGAGAACXAATCACCCAGAGAGAGCTGGTCTCCAGGCAGGTGAG 
CGACGACCTTACAGAGCGAGCCGCCACCtXXGGGCTCAXCCXGGATGACGTGXCCXXGACACAXCXGACCXXCGG 
GAAGGAGTTCACAGAAGCGGXGGAAGCCAAACAGGTGGCTCAGCAGGAAGCAGAGAGGGCCAGATTTGTGGXGGA 
AAAGGCTGAGCAACAGAAAAAGGCGGCCAXCAXCXCXGCXGAGGGCGACXCCAAGGCAGCXGAGCXGAXXGCCAA 
CTCACTGGCCACXGCAGGGGAXGGCCXGATCGAGCXGCGCAAGCTGGAAGCXGCAGAGGACAXCGCGTACCAGCX 
CTCACGCTCTCGGAACATCACCTACCTGCCAGCGGGGCAGTCCGXGCXCCXCCAGCXGCCCCAGTGAGGGCCCAC 
CCXGCCXGCACCXCCGCGGGCXGACXGGGCCACAGCCCCGAXGAXXCXXAACACAGCCTXCCXXCXGCTCCCACC 
CCAGAAATCACTGXGAAATTTCAXGATXGGCTTAAAGXGAAGGAAAXAAAGGTAAAAXCACTTCAGAXCXCXAAT 
XAGTCTAXCAAAXGAAACTCTTTCAXXCTTCTCACAXCCATCTACXTTTTTATCCACCTCCCTACCAAAAATTGC 
CAAGXGCCTAXGCAAACCAGCTXTAGGXCCCAAXTCGGGGCCXGCXGGAGXXCCGGCCXGGGCACCAGCAXXXGG 
CAGCACGCAGGCGGGGCAGXAXGXGATGGACTGGGGAGCACAGGXGXCTGCCTAGAXCCACGTGTGGCCXCCGTC 
CXGXCACXGATGGAAGGTTTGCGGAXGAGGGCAXGXGCGGCTGAACXGAGAAGGCAGGCCTCCGTCXXCCCAGCG 
GTTCCTGXGCAGATGCXGCXGAAGAGAGGTGCCGGGGAGGGGCAGAGAGGAAGTGGXCXGXCXGXTACCAXAAGX 
CXGAXXCXCXXXAACXGXGXGACCAGCGGAAACAGGXGXGXGXGAACXGGGCACAGATTGAAGAATCTGCCCCXG 
XXGAGGTGGGXGGGCCTGACXGXXGCCCCCCAGGGXCCTAAAACXXGGATGGACTXGXAXAGTGAGAGAGGAGGC 
CXGGACCGAGATGTGAGXCCTGTTGAAGACTTCCTCTCTACCCCCCACCXXGGTCCCXCTCAGATACCCAGXGGA 
AXXCCAACXXGAAGGAXXGCAXCCTGCXGGGGCXGAACAXGCCXGCCAAAGACGTGTCCGACCXACGTXCCXGGC 
CCCCTCGTXCAGAGACXGCCCXTCTCACGGGCXCXATGCCTGCACXGGGAAGGAAACAAATGXGXAXAAACTGCX 
GXCAATAAAXGACACCCAGACCTTCC 
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FIGURE 87 

MAAKVFESIGKFGLALAVAGGWNSALYNVDAGHRAVIFDRFRGVQDIWGEGTHFLIPWVQKPIIFDCRSRPRN 
VPVITGSKDLQNVNITLRILFRPVASQLPRIFTSIGEDYDERVLPSITTEILKSVVARFDAGELITQRELVSRQV 
SDDLTERAATFGLILDDVSLTHLTFGKEFTEAVEAKQVAQQEAERARFWEKAEQQKKAAIISAEGDSKAAELIA 
NSLATAGDGLIELRKLEAAEDIAYQLSRSRNITYLPAGQSVLLQLPQ 
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FIGURE 88 

GGCACGAGGGGCCGGGGGGCGCAGCTAGAGAGCCCCGGAGCCGCGGCGGGAGAGGAACGCGCAGCCAGCCTTGGG 
AAGCCCAGGCCCGGCAGCC ATGG CGGTGGAAGGAGGAATGAAATGTGTGAA6TTCTTGCTCTACGTCCTCCTGCT 
GGCCTTTTGCGCCTGTGCAGTGGGACTGATTGCCGTGGGTGTCGGGGCACAGCTTGTCCTGAGTCAGACCATAAT 
CCAGGGGGCTACCCCTGGCTCTCTGTTGCCAGTGGTCATCATCGCAGTGGGTGTCTTCCTCTTCCTGGTGGCTTT 
TGTGGGCTGCTGCGGGGCCTGCAAGGAGAACTATTGTCTTATGATCACGTTTGCCATCTTTCTGTCTCTTATCAT 
GTTGGTGGAGGTGGCCGCAGCCATTGCTGGCTATGTGTTTAGAGATAAGGTGATGTCAGAGTTTAATAACAACTT 
CCGGCAGCAGATGGAGAATTACCCGAAAAACAACCACACTGCTTCGATCCTGGACAGGATGCAGGCAGATTTTAA 
GTGCTGTGGGGCTGCTAACTACACAGATTGGGAGAAAATCCCTTCCATGTCGAAGAACCGAGTCCCCGACTCCTG 
CTGCATTAATGTTACTGTGGGCTGTGGGATTAATTTCAACGAGAAGGCGATCCATAAGGAGGGCTGTGTGGAGAA 
GATTGGGGGCTGGCTGAGGAAAAATGTGCTGGTGGTAGCTGCAGCAGCCCTTGGAATTGCTTTTGTCGAGGTTTT 
GGGAATTGTCTTTGCCTGCTGCCTCGTGAAGAGTATCAGAAGTGGCTACGAGGTGATG5CAGGGGTCTGGTCTCCT 
CAGCCTCCTCATCTGGGGGAGTGGAATAGTATCCTCCAGGTTTTTCAATTAAACGGATTATTTTTTCAGACCGAA 
AAGAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 89 

MAVEGGMKCVKFLLYVLLLAFCACAVGLIAVGVGAQLVLSQTIIQGATPGSLLPWIIAVGVFLFLVAFVGCCGA 
CKENYCLMITFAIFLSLIMLVEVAAAIAGYVFRDKVMSEFNNNFRQQMENYPKNNHTASILDRMQADFKCCGAAN 
YTDWEKIPSMSKNRVPDSCCINVTVGCGINFNEKAIHKEGCVEKIGGWLRKNVLWAAAALGIAFVEVLGIVFAC 
CLVKSIRSGYEVM 
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FIGURE 90 

CGGGAGAGCGCGCTCTGCCTGCCGCCT6CCTGCCT6CCACTGAGGGTTCCCAGCACC ATGA .GG6CCTGGATCTTC 
XTTCTCCTTTGCCTGGCCGGGAGGGCCTTGGCAGCCCCTCAGCAAGAAGCCCTGCCTGATGAGACAGAGGTGGTG 

GAAGAAACTGTGGCAGAGGTGACTGAGGTATCTGTGGGAGCTAATCCTGTCCAGGTGGAAGTAGGAGAATTTGAT 
GATGGTGCAGAGGAAACCGAAGAGGAGGTGGTGGCGGAAAATCCCTGCCAGAACCACCACTGCAAACACGGCAAG 
GTGTGCGAGCTGGATGAGAACAACACCCCCATGTGCGTGTGCCAGGACCCCACCAGCTGCCCAGCCCCCATTGGC 
GAGTTTGAGAAGGTGTGCAGCAATGACAACAAGACCTTCGACTCTTCCTGCCACTTCTTTGCCACAAAGTGCACC 
CTGGAGGGCACCAAGAAGGGCCACAAGCTCCACCTGGACTACATCGGGCCTTGCAAATACATCCCCCCTTGCCTG 
GACTCTGAGCTGACCGAATTCCCCCTGCGCATGCGGGACTGGCTCAAGAACGTCCTGGTCACCCTGTATGAGAGG 
GATGAGGACAACAACCTTCTGACTGAGAAGCAGAAGCTGCGGGTGAAGAAGATCCATGAGAATGAGAAGCGCCXG 
GAGGCAGGAGACCACCCCGTGGAGCTGCTGGCCCGGGACTTCGAGAAGAACTAXAACATGTACATCTTCCCTGTA 
CACTGGCAGTTCGGCCAGCTGGACCAGCACCCCATTGACGGGTACCTCTCCCACACCGAGCTGGCTCCACTGCGT 
GCTCCCCTCATCCCCATGGAGCATTGCACCACCCGCTTTTTCGAGACCTGTGACCTGGACAATGACAAGTACATC 
GCCCTGGATGAGTGGGCCGGCTGCXTCGGCATCAAGCAGAAGGATATCGACAAGGAXCTTGTGATCTAAATCCAC 
TCCTTCCACAGTACCGGATXCTCTCTTTAACCCTCCCCXTCGTGXTTCCCCCAAXGTTTAAAAXGTXTGGAXGGX 
TTGTTGTTCTGCCTGGAGACAAGGTGCTAACATAGAXXXAAGTGAATACATTAACGGTGCTAAAAATGAAAAXXC 
XAACCCAAGACAXGACAXXCXXAGCXGXAACXXAACTATTAAGGCCXXXXCCACACGCAXXAATAGTCCCATTTT 
TCTCXTGCCATTTGTAGCTTTGCCCAXTGTCTTATTGGCACATGGGXGGACACGGATCTGCXGGGCXCTGCCXXA 
AACACACATTGCAGCTTCAACTTTTCTCTTTAGTGTTCTGTXXGAAACTAATACTTACCGAGTCAGACTTTGTGT 
XCAXXXCAXTXCAGGGXCXXGGCXGCCXGXGGGCXXCCCCAGGXGGCCTGGAGGXGGGCAAAGGGAAGXAACAGA 
CACACGAXGTXGTCAAGGATGGXXXXGGGACTAGAGGCTCAGTGGTGGGAGAGATCCCTGCAGAAXCCACCAACC 
AGAACGTGGTTTGCCTGAGGCTGTAACTGAGAGAAAGATTCTGGGGCTGTCTXATGAAAAXAXAGACATXCXCAC 
AXAAGCCCAGTXCATCACCAXXTCCXCCTTTACCXXTCAGTGCAGTTXCTTTTCACATTAGGCTGTXGGTTCA2^ 
CXXXTGGGAGCACGGACTGTCAGXXCTCTGGGAAGXGGTCAGCGCATCCTGCAGGGCXTCXCCXCCXCTGTCTTX 
TGGAGAACCAGGGCXCXXCXCAGGGGCXCXAGGGACXGCCAGGCXGXXXCAGCCAGGAAGGCCAAAATCAAGAGT 
GAGATGTAGAAAGXXGXAAAAXAGAAAAAGXGGAGXXGGTGAAXCGGTTGTTCTTTCCTCACATTTGGATGATXG 
XCATAAGGTXTXTAGCATGXTCCTCCXXTTCTTCACCCTCCCCXTXGXTCTXCXATXAAXCAAGAGAAACXXCAA 
AGTXAATGGGAXGGXCGGATCXCACAGGCTGAGAACTCGTTCACCTCCAAGCATTTCATGAAAAAGCXGCXTCTX 
ATTAATCAXACAAACTCTCACCATGATGTGAAGAGTXXCACAAATCTXTCAAAAXAA2\AAGTAATGACTTAGAAA 
CXGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 91 

MRAWIFFLLCLAGRALAAPQQEALPDETEWEETVAEVTEVSVGANPVQVEVGEFDDGAEETEEEWAENPCQNH 
HCKHGKVCELDENNTPMCVCQDPTSCPAPIGEFEKVCSNDNKTFDSSCHFFATKCTLEGTKKGHKLHLDYIGPCK 

YIPPCLDSELTEFPLRMRDWLKNVLVTLYERDEDNNLLTEKQKLRVKKIHENEKRLEAGDHPVELLARDFEKNYN 
MYIFPVHWQFGQLDQHPIDGYLSHTELAPLRAPLIPMEHCTTRFFETCDLDNDKYIALDEWAGCFGIKQKDIDKD 
LVI 
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FIGURE 92 

CCG6CCCGCGCCCCGCAGGCCGCCCGCCGCCCGCGCCGCC ATGG GAGTGGAGGGCTGCACCAAGTGCATCAAGTA 
CCTGCTCTTCGTCTTCAATTTCGTCTTCTGGCTGGCTGGAGGCGTGATCCTGGGTGTGGCCCTGTGGCTCCGCCA 
TGACCCGCAGACCACCAACCTCCTGTATCTGGAGCTGGGAGACAAGCCCGCGCCCAACACCTTCTATGTAGGCAT 
CTACATCCTCATCGCTGTGGGCGCTGTCATGATGXTCGTTGGCTTCCTGGGCTGCTACGGGGCCATCCAGGAATC 
CCAGTGCCTGCTGGGGACGTTCTTCACCTGCCTGGTCATCCTGTTTGCCTGTGAGGTGGCCGCCGGCATCTGGGG 
CTTTGTCAACAAGGACCAGATCGCCAAGGATGTGAAGCAGTTCTATGACCAGGCCCTACAGCAGGCCGTGGTGGA 
TGATGACGCCAACAACGCCAAGGCTGTGGTGAAGACCTTCCACGAGACGCTTGACTGCTGTGGCTCCAGCACACT 
GACTGCTTTGACCACCTCAGTGCTCAAGAACAATTTGTGTCCCTCGGGCAGCAACATCATCAGCAACCTCTTCAA 
GGAGGACTGCCACCAGAAGATCGATGACCTCTTCTCCGGGAAGCTGTACCTCATCGGCATTGCTGCCATCGTGGT 
CGCTGTGATCATGATCTTCGAGATGATCCTGAGCATGGTGCTGTGCTGTGGCATCCGGAACAGCTCCGTGTACTG 
AGGCCCCGCAGCTCTGGCCACAGGGACCTCTGCAGTGCCCCCTAAGTGACCCGGACACTTCCGAGGGGGCCATCA 
CCGCCTGTGTATATAACGTTTCCGGTATTACTCTGCTACACGTAGCCTTTTTACTTXTGGGGTTTTGTTTTTGTT 
CTGAACTTTCCTGTTACCTTTTCAGGGCTGACGTCACATGTAGGXGGCGXGTATGAGTGGAGACGGGCCTGGGTC 
XXGGGGACXGGAGGGCAGGGGXCCXXCXGCCCXGGGGXCCCAGGGXGCXCTGCCTGCTCAGCCAGGCCXCTCCTG 
GGAGCCACTCGCCCAGAGACTCAGCTTGGCCAACXTGGGGGGCTGTGTCCACCCAGCCCGCCCGTCCTGTGGGCT 
GCACAGCTCACCTTGTTCCCTCCXGCCCCGGTTCGAGAGCCGAGXCXGXGGGCACXCXCXGCCXXCATGCACCXG 
XCCTTTCTAACACGXCGCCTXCAACXGXAAXCACAACATCCXGACXCCGTCATTTAAT2\AAGAAGGAACATCAGG 
CATGCTAAAAAAAAAAAAAAAAiyyyuyy^AAAAAAAAA^ 
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FIGURE 93 

MGVEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFYVGIYILIAVGAVMMFV 
GFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQIAKDVKQFYDQALQQAWDDDANNAKAWKTF 
HETLDCCGSSTLTALTTSVLKNNLCPSGSNIISNLFKEDCHQKIDDLFSGKLYLIGIAAIWAVIMIFEMILSMV 
LCCGIRNSSVY 
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FIGURE 94A 

GCTAAGTTAGCTTTTCAACTGGCACTGTATGGCAGCATTTTTGGTATGGTTAGCGTGGCACATGGCGAAACATAA 
AGCATTTTACTGTACAGGTAAGGAATGTGCCATGTGTTTTACCTATCTCTCTTTCTCTCTCACTCCCATGCACAC 
ATCCTGTGTGTATTCAGAGACCTTCAGAAACATTCATATTCATTTTCATGAGTCAGCAAAAGCCCTATGCTTGAT 
TCCAACAGAATATTTCCTTTACATACTTTCTTCTCTTAATTTTTACAAAATTGTATGGTAGGTGTAAAAGAAAAT 
CATAGTAACTGTACCATATTATTAACCCCTAAATCAAACXTTTTTTGTCTTGTGTATCTTGATTTTTCTGTGTGC 
TTTATAGTGAAGCAGCCGACACGAGTCGTTGTTCATAAAACAGCTTTTGAAAGTTGAGAGCACACCCCTGGAGAA 
CCGACTGTGCTTGCTTACGTTTGGTTCATGACXTAAAAATCGAGTACAGGTGATGAAATCTTGGCAGTGTTAACA 
AAAAAGTAGTGTGTATTGTGCTATTTTTTTTTTACTCTAGAAACTTAACCATTTGTAGAGAAAAAGGAAACAAAT 
TTTCACACATTGAAGTTCATTCTGACATAAAATTA ATGA TAAATAATCATAGAAATCAAGCTTTGTATTTTAGCG 
AACATAAGTACTTTCAACAAACTCAGGTGGTGTATCAGGGAGACATTTTCTGGGTGTTTTTGTGTGTTTTCTGTC 
TTAGAAAAGAATGTGTTCTAATGCAAGGATGTTTCTCTGCAGGAGTTATTCCTGATGAAGCTAAAGCTTTGTCTC 
TGTTGGCACCAGCTAATGCAGTGGCAGGTCTTCTGCCTGGTGGTGGACTCCTGCCTACTCCTAACCCACTTACCC 
AGATTGGCGCTGTTCCACTGGCTGCTTTGGGGGCTCCTACTCTTGATCCXGCCCTTGCTGCACTTGGGCTTCCTG 
GAGCAAACTTGAACTCTCAGTCTCTTGCTGCAGATCAGTTGCTGAAGCTTATGAGTACTGTTGATCCCAAGTTGA 
ATCATGTAGCTGCTGGTCTCGTTTCACCAAGTCTGAAATCGGATACCTCTAGTAAAGAAATAGAGGAAGCTATGA 
AAAGAGTACGAGAAGCACAGTCCCTAATTTCTGCTGCTATAGAACCAGATAAGAAAGAAGAAAAAAGAAGGCATT 
CAAGATCAAGATCACGTTCTAGGAGGAGGAGGACTCCCTCATCTTCTAGACACAGGCGGTCAAGAAGCAGATCGA 
GACGGCGGTCACATTCTAAGTCTAGGAGTCGGCGACGATCCAAAAGCCCAAGGCGGAGAAGATCTCATTCCAGAG 
AAAGAGGTAGAAGGTCAAGGAGCACATCAAAAACAAGAGACAAAAAGAAAGAAGACAAAGAAAAGAAACGTTCTA 
AAACACCACCAAAAAGTTACAGCACAGCCAGACGTTCTAGAAGTGCAAGCAGAGTATATTTGAAGTAACATGGAA 
TTCAAGAGAGAGAGAGACGACGACGAAGAAGCAGGAGTGGCACAAGATCTCCTAAAAAGCCXCGGTCTCCTAAAA 
GAAAATTGTCCCGCTCACCATCCCCTAGGAGACATAAAAAGGAGAAGAAGAAAGATAAAGACAAAGAAAGAAGTA 
GGGATGAAAGAGAACGATCAACAAGCAAGAAGAAGAAGAGTAAAGATAAGGAAAAGGACCGGGAAAGAAAATCAG 
AGAGTGATAAAGATGTAAAACAGGTTACACGGGATTATGATGAAGAGGAACAGGGGTATGACAGTGAGAAAGAGA 
AAAAAGAAGAGAAGAAACCAATAGAAACAGGTTCCCCTAAAACAAAGGAATGTTCTGTGGAAAAGGGAACTGGTG 
ATTCACTAAGAGAATCCAAAGTGAATGGGGATGATCATCATGAAGAAGACATGGATATGAGTGACTGAATATTGC 
CTCTGAGGGAGTCCAACTGTATACCTGCATCAGTGTCATTCCTTXGTGTGATTTCTTAATGCTGTATTTGTTCAT 
CTCAAACCTAGATGTATACAGCTCTGAGTTATAAATGGTTATAAAGCTCCTGTTACTCATATTAGTTATTTACAT 
CAAAAAGCTTTTAGAAAATGGTACGAGGTAACCAATTCTTGTCATGGXGAAATCTGATTGAGXAACCAAGCAGTT 
XXACXATTCTGGXGCXGCTTCAXAACAAAAATGAAAAGCTGCATGCAXCXACAGCAGGCAXGGATTGXTXAXGTC 
GXATGATAXCCTTTATTAAGXAAGTTCACTXAXAGXAXXTCXAXAATTXGAXXCATXGCCGTAAXAGAGCCATGX 
AGGAAATGCACXGATXGCATGXXAXXGXGGCAAGAAXATCCTAAAXGXCATTAAAAXCCXCCAACATGATGGAXC 
TACXTATGGTCXXGTTTGXXGACAXGACAAATTAACAXXCTXAXAGXXACAXCTGGAAAXGAGCATXTGAAAXAG 
ATAATCCTTTAAGCCTXGTGGCAAAATTTTTGTGGCTTTTGTTTAACTTTGAAAGGXTATTATGCACXAACCTTX 
TXXGGXGGCXAAXXAGGGXXXAAATACAGAAACAAGATTTCAAAXAAAACXGTCXXXGGCAGTGAGXAAATAGCA 
XATXXXGAAGXAGAGXTGTATACXXXXXCAXAAGAXGXXXGGGAATXTXXXXCCTGAAGXAAXAAXTXAXXCCAC 
AXCTACAXCAGTGAAAGCTATCXACCXAXCCTGAGTCXAXCXXAAAGGAAAAAAAGAAAAAAACCXTAXCXCTXG 
CCCXTATTTTGAAXTTTCCACXCTXTCATTAATXTGXXXXAAGCTCCGXGTXGGAAAAAAGGGGXAGXGCAXXTX 
AAATTGACCTXCAXACGCXXXXA2VAATAAGACAAAXCTACXXGAXAATGTACCTXXAXXTGATCTCAAGXXGTAT 
AAAACCAAXAAAXXXGXGTXACXGCAGXAGXAATCXXATGCACACGGXGATTTCATGXXAXAXATGCAAAGTAGG 
CAACTGTTTXCXTAGTTACAGAAGTTTCAAGCXXCACXXXTGTGCAGTAGAAACAAAAGTAGGCXACAGTCTGXG 
CCAXGXXGATGXACAGTTTCXGAAAXTGXXXXACAAGACXXXGATAAXAAAACCCTXAAACTXATGTXCATGXXC 
CXGXAAAACCGXATTTGXAXXXAXXXACGCTACXGAAXGTATGACAXXXACCXCAXTCATXTXACAAAXXCXTTC 
CCXTTCTGTCCACAXATTXCAGTATAGXAAAAAGAGGAAGTCTATCACTGXAGTGATAATXGCCATCAAAATTGX 
CAAAAATGAXXXAAXTXCXATCCAAAATAGXCCXXXTCXXAGCXXAGXAXCAXXXXAXXGCXTAXXXXTTGXGXG 
GGAAXGGGGTTGGAXAAAGC2^XGAACTTXAGXATAAACAAAXCCCACCXAXATCXAGCAAAXXXATAXXTXCGG 
XGAAAXACAGAXAXXTGCCTTTCTGGAGXAGTAXAGAAGCXGXCAAXATGTAXCTACTGXACAGTACXAAAXAGX 
AXTCAXXTAXGAAAXGAGXAGXGXXXGGGXGGCXGGGGXXAAGGGAAAAXGAGACXXGGAAXTGXAGCTXXXAXC 



wo 2004/041170 



PCT/US2003/034312 



107/2825 

FIGURE 94B 

CAAGTTTTGAGTATAAATAGGGTTTTGTTTTGTTTTTTTTAACCTAAAAACTGAAATGCCATATAGAAAAACAGC 
ATTGTTTTTACAGTTTGTAGTAAGTAACTTTTTAAAGATTTTATCAAAAAGAATTTTGTCTATAGTGAGTAAAAG 
AAGTTCTAATAATGGTCCTAATCACTGCATTTTTAAAAAACAAAGTTCAACACAAATGACATTTGTTTTAAACTT 
TAGTAGATAAAAGGTGAACCATGTGACATGGGCATTTTTGTAAGTCAAAAACAAATXTCACATATGGTAAACCTA 
ATATTCACAGTGTGTTCCCTCACTTGTAATCTCTGAATACAAATATACTAGCTTTTCTAAAGGGAATCATTTTTT 
TAAAAGTAGTGCCACTGACAAGATGCTACAGTGAAGATTATCCATTCTTAGGATATTTATTTTCAGTGAACATTT 
TCTGCACAAAGGTAGTGTTGCACTGGGACACAAGCCTTTTAACAGATAACCAGTTGAAATCAAACACTGCCTCCA 
CACCGAGTTCTGTTGTGTATTTGATAGTAAATTGATTTAAAAATAAAAGTGGTTTTTGTTAGAAA 
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FIGURE 95 

MINNHRNQALYFSEHKYFQQTQVVYQGDIFWVFLCVFCLRKECVLMQGCFSAGVIPDEAKALSLLAPANAVAGLL 
PGGGLLPTPNPLTQIGAVPLAALGAPTLDPALAALGLPGANLNSQSLAADQLLKLMSTVDPKLNHVAAGLVSPSL 
KSDTSSKEIEEAMKRVREAQSLISAAIEPDKKEEKRRHSRSRSRSRRRRTPSSSRHRRSRSRSRRRSHSKSRSRR 
RSKSPRRRRSHSRERGRRSRSTSKTRDKKKEDKEKKRSKTPPKSYSTARRSRSASRVYLK 
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FIGURE 96 

GGCACGAGGGGAGCGCTTGTTTGCTGCCTCGTACTCCTCCATTTATCCGCC ATGS VTAAGTGCCAGCCGAGCTGCA 
GCAGCCCGTCTCGTGGGCGCCGCAGCCTCCCGGGGCCCTACGGCCGCCCGCCACCAGGATAGCTGGAATGGCCTT 
AGTCATGAGGCTTTTAGACTTGTTTCAAGGCGGGATTATGCATCAGAAGCAATCAAGGGAGCAGTTGTTGGTATT 
GATTTGGGTACTACCAACTCCTGCGTGGCAGTTATGGAAGGTAAACGAGCAAAGGTGCTGGAGAATGCCGAAGGX 
GCCAGAACCACCCCTTCAGTTGTGGCCTTXACAGCAGATGGTGAGCGACTTGTTGGAATGCCGGCCAAGCGACAG 
GCTGTCACCAACCCAAACAATACATTTTATGCTACCAAGCGTCTCATTGGCCGGCGATATGATGATCCTGAAGTA 
CAGAAAGACATTAAAAATGTTCCCTTTAAAATTGTCCGTGCCTCCAATGGTGATGCCTGGGTXGAGGCTCATGGG 
AAATTGTATTCTCCGAGTCAGATTGGAGCATTTGTGTTGATGAAGATGAAAGAGACTGCAGAAAATTACTTGGGG 
CGCACAGCAAAAAATGCTGTGATCACAGTCCCAGCTTATTTCAATGACTCGCAGAGACAGGCCACTAAAGATGCT 
GGCCAGATATCTGGACTGAATGTGCTTCGGGTGATTAATGAGCCCACAGCXGCTGCTCTTGCCTATGGTCTAGAC 
AAATCAGAAGACAAAGXCATTGCTGTATATGATTXAGGTGGXGGAACTTTTGATATTTCTAXCCXGGAAAXTCAG 
AAAGGAGXATXTGAGGTGAAATCCACAAATGGGGAXACCTTCTXAGGXGGGGAAGACTTXGACCAGGCCTTGCTA 
CGGCACAXXGXGAAGGAGTXCAAGAGAGAGACAGGGGXXGAXTXGACXAAAGACAACAXGGCACXTCAGAGGGXA 
CGGGAAGCXGCTGAAAAGGCXAAGTGTGAACTCXCCXCAXCXGTGCAGACTGACATCAATXXGCCCXATCTTACA 
ATGGATTCTTCTGGACCCAAGCATTTGAATATGAAGTTGACCCGTGCTCAATTTGAAGGGATTGTCACTGATCTA 
AXCAGAAGGACXAXCGCTCCATGCCAAAAAGCXAXGCAAGATGCAGAAGTCAGCAAGAGXGACAXAGGAGAAGXG 
ATXCXXGXGGGXGGCAXGACXAGGATGCCCAAGGTXCAGCAGACTGTACAGGATCXTXXXGGCAGAGCCCCAAGX 
AISlAGCTGTCAAXCCXGAXGAGGCTGTGGCCAXXGGAGCTGCCAXXCAGGGAGGTGTGTXGGCCGGCGATGTCACG 

gaxgxgctgctccttgatgxcactccccxgxcxctgggxattgaaacxcxaggaggtgtctttaccaaacxxaxx 
aaxaggaataccacxattccaaccaagaagagccaggtattcxcxactgccgcxgatggtcaaacgcaagxggaa 
axtaaagtgxgxcagggxgaaagagagaxggctggagacaacaaacxccttggacagxxxacxxxgaxtggaaxt 
ccaccagcccctcgtggagttccxcagattgaagttacatttgacatxgatgccaatgggatagtacaxgtttct 
gcxaaagataaaggcacaggacgtgagcagcagattgtaatccagtctxctggxggattaagcaaagaxgatatt 
gaaaaxaxggxxaaaaaxgcagagaaataxgcxgaagaagaccggcgaaagaaggaacgagttgaagcagtxaax 

ATGGCXGAAGGAATCATTCACGACACAGAAACCAAGAXGGAAGAATTCAAGGACCAATTACCTGCTGATGAGTGC 

aacaagctgaaagaagagatxxccaaaaxgagggagctccxggcxagaaaagacagcgaaacaggagaaaataxx 

agacaggcagcatcctcxcttcagcaggcaxcaxxgaagctgtxcgaaaxggcatacaaaaagaxggcaxcxgag 

cgagaaggctctggaagttctggcactggggaacaaaaggaagatcaaaaggaggaaaaacagtam 

aaatttxgaagccagaaggacaacatatgaagctxaggagxgaagagacttcctgagcagaaaxgggcgaacttc 

agxcxttxtacxgxgxxtttgcagtatxctaxaxaxaaxxtcctxaatttgxaaaxxxagtgaccatxagcxagx 

GAXCATTTAATGGACAGTGATTCXAACAGTATAAAGTTCACAATAXTCTATGTCCCTAGCCTGXCATTTTTCAGC 
XGCATGXAAAAGGAGGTAGGATGAAXTGATCAXXAXAAAGAXXTAACTATXXXAXGCTGAAGTGACCATATXXTC 
AAGGGGTGAAACCAXCXCGCACACAGCAATGAAGGXAGXCAXCCAXAGACXXGAAAXGAGACCACAXATGGGGAX 
GAGAXCCTTCTAGTTAGCCTAGXACXGCTGTACTGGCCTGTATGTACAXGGGGTCCXTCAACXGAGGCCTTGCAA 
GTCAAGCXGGCXGXGCCATGTXXGXAGAXGGGGCAGAGGAATCTAGAACAATGGGAAACTTAGCTAXXXATATTA 
GGXACAGCTATTAAAACAAGGTAGGAAXGAGGCXAGACCXXXAACTTCCCXAAGGCAXACTXXTCXAGCTACCXX 
CXGCCCTGTGTCTGGCACCTACATCCTXGATGATTGTTCXCXXACCCATTCTGGAATTTTXXXXXXXTXTAAATA 
AAXACAGAAAGCAXCXXGAAAAAAAAAAAAAAAAAA 
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FIGURE 97 

MISASRAAAARLVGAAASRGPTAARHQDSWNGLSHEAFRLVSRRDYASEAIKGAWGIDLGTTNSCVAVMEGKRA 
KVLENAEGARTTPSWAFTADGERLVGMPAKRQAVTNPNNTFYATKRLIGRRYDDPEVQKDIKNVPFKIVRASNG 
DAWVEAHGKLYSPSQIGAFVLMKMKETAENYLGRTAKNAVITVPAYFNDSQRQATKDAGQISGLNVLRVINEPTA 
AALAYGLDKSEDKVIAVYDLGGGTFDISILEIQKGVFEVKSTNGDTFLGGEDFDQALLRHIVKEFKRETGVDLTK 
DNMALQRVREAAEKAKCELSSSVQTDINLPYLTMDSSGPKHLNMKLTRAQFEGIVTDLIRRTIAPCQKAMQDAEV 
SKSDIGEVILVGGMTRMPKVQQTVQDLFGRAPSKAVNPDEAVAIGAAIQGGVLAGDVTDVLLLDVTPLSLGIETL 
GGVFTKLINRNTTIPTKKSQVFSTAADGQTQVEIKVCQGEREiyiAGDNKLLGQFTLIGIPPAPRGVPQIEVTFDID 
ANGIVHVSAKDKGTGREQQIVIQSSGGLSKDDIENM^NAEKYAEEDRRKKERVEAVNMAEGIIHDTETKMEEFK 
DQLPADECNKLKEEISKMRELLARKDSETGENIRQAASSLQQASLKLFEMAYKKMASEREGSGSSGTGEQKEDQK 
EEKQ 
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FIGURE 98 

GGCACGAGGGGAGCGCTTGTTTGCTGCCTCGTACTCCTCCATTTATCCGCCATGATAAGTGCCAGCCGAGCTGCA 
GCAGCCCGTCTCGTGGGCGCCGCAGCCTCCCGGGGCCCTACGGCCGCCCGCCACCAGGATAGCTGGAATGGCCTT 
AGTCATGAGGCTTTTAGACTTGTTTCAAGGCGGGATTATGCATCAGAAGCAATCAAGGGAGCAGTTGTTGGTATT 
GATTTGGGTACTACCAACTCCTGCGTGGCAGTTATGGAAGGTAAACGAGCAAAGGTGCTGGAGAATGCCGAAGGT 
GCCAGAACCACCCCTTCAGTTGTGGCCTTTACAGCAGATGGTGAGCGACTTGTTGGAATGCCGGCCAAGCGACAG 
GCTGTCACCAACCCAAACAATACATTTTATGCTACCAAGCGTCTCATTGGCCGGCGATATGATGATCCTGAAGTA 
CAGAAAGACATTAAAAATGTTCCCTTTAAAATTGTCCGTGCCTCCAATGGTGATGCCTGGGTTGAGGCTCATGGG 
AAATTGTATTCTCCGAGTCAGATTGGAGCATTTGTGTTGATGAAGATGAAAGAGACTGCAGAAAATTACTTGGGG 
CGCACAGCAAAAAATGCXGTGATCACAGTCCCAGCTTATTTCAATGACTCGCAGAGACAGGCCACTAAAGATGCT 
GGCCAGATATCTGGACXGAATGTGCTTCGGGTGATTAATGAGCCCACAGCTGCTGCTCTTGCCTATGGTCTAGAC 
AAATCAGAAGACAAAGTCATTGCTGTATATGATTTAGGTGGTGGAACTTTTGATATTTCTATCCTGGAAATTCAG 
AAAGGAGTATTTGAGGTGAAATCCACAAATGGGGATACCTTCTTAGGTGGGGAAGACTTTGACCAGGCCTTGCTA 
CGGCACATXGTGAAGGAGTTCAAGAGAGAGACAGGGGXXGATTTGACTAAAGACAACATGGCACTTCAGAGGGTA 
CGGGAAGCTGCTGAAAAGGCTAAGXGTGAACTCTCCXCATCTGTGCAGACTGACATCAATXTGCCCTATCTTACA 
AXGGATTCTXCTGGACCCAAGCATTTGAAXATGAAGTTGACCCGTGCTCAATTTGAAGGGATTGTCACXGAXCXA 
ATCAGAAGGACXAXCGCXCCAXGCCAAAAAGCXATGCAAGAXGCAGAAGTCAGCAAGAGTGACAXAGGAGAAGTG 
ATTCXTGTGGGTGGCATGACXAGGATGCCCAAGGXXCAGCAGACTGTACAGGAXCXXXXXGGCAGAGCCCCAAGT 
AAAGCTGTCAATCCTGAXGAGGCTGTGGCCATTGGAGCTGCCATTCAGGGAGGXGTGTXGGCCGGCGATGXCACG 
GATGXGCTGCXCCTTGATGTCACXCCCCXGTCTCTGGGXAXXGAAACXCXAGGAGGTGTCXXTACCAAACTTATT 
AAXAGGAATACCACTATTCCAACCAAGAAGAGCCAGGXATTCXCXACXGCCGCTGAXGGXCAAACGCAAGTGGAA 
AXTAAAGTGXGXCAGGGXGAAAGAGAGATGGCXGGAGACAACAAACTCCTTGGACAGTTTACTTTGAXTGGAATT 
CCACCAGCCCCTCGTGGAGTTCCXCAGATTGAAGTTACAXTXGACAXXGATGCCAAXGGGATAGTACATGTTTCX 

gctaaagaxaaaggcacaggacgxgagcagcagattgxaaxccagtcxtctggtggatxaagcaaagatgaxaxx 
gaaaataxggtxaaaaatgcagagaaataxgctgaagaagaccggcgaaagaaggaacgagttgaagcagttaat 
atggctgaaggaatcattcacgacacagaaaccaagatggaagaattcaaggaccaattacctgctgatgagxgc 
aacaagcxgaaagaagagaxxxccaaaatgagggagcxccxggctagAaaagacagcgaaacaggagaaaaxaxx 
agacaggcagcatcctctcttcagcaggcaxcattgaagctgttcgaaatggcatacaaaaagaxggcatctgag 
cgagaaggcxctggaagttcxggcactggggaacaaaaggaagaxcaaaaggaggaaaaacagtaaxaatagcag 
aaaxxttgaagccagaaggacaacatatgaagcxxaggagtgaagagactxccxgagcagaaatgggcgaacttc 
agxctttttactgtgxxtttgcagtattctataxataatttccttaaxxtgxaaatttagtgaccatxagctagt 

GATCATXTAATGGACAGXGAXTCXAACAGTATAAAGTTCACAAXATTCTATGXCCCTAGCCTGTCATTTTTCAGC 

xgcaxgtaaaaggaggtaggatgaattgatcaxxataaagaxxxaacxaxxxxaxgcxgaagxgaccaxattttc 

AAGGGGTGAAACCAXCTCGCACACAGCAATGAAGGTAGTCATCCATAGACXXGAAATGAGACCACATAXGGGGAT 

gagatccttctagttagcctagxacxgctgtactggcctgxaxgxacatggggtccxtcaacxgaggcctxgcaa 

GXCAAGCTGGCXGTGCCAXGTXXGXAGAXGGGGCAGAGGAAXCXAGAACAAXGGGAAACXXAGCTATXXATAXXA 

ggtacagctattaaaacaaggtaggaatgaggcxagacctttaacttccctaaggcatacttttctagcxacctt 
ctgcccxgxgtctggcaccxacaxccxtgatgattgttctcttacccattctggaaxxxtttttttttttaaaxa 
aatacagaaagcaxcxxgaaaaaaaaaaaaaaaaaa 
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FIGURE 99 

MISASRAAAARLVGAAASRGPTAARHQDSWNGLSHEAFRLVSRRDYASEAIKGAWGIDLGTTNSCVAVMEGKRA 

KVLENAEGARTTPSWAFTADGERLVGMPAKRQAVTNPNNTFYATKRLIGRRYDDPEVQKDIKNVPFKIVRASNG 

DAWVEAHGKLYSPSQIGAFVLMKMKETAENYLGRTAKNAVITVPAYFNDSQRQATKDAGQISGLNVLRVINEPTA 

AALAYGLDKSEDKVIAVYDLGGGTFDISILEIQKGVFEVKSTNGDTFLGGEDFDQALLRHIVKEFKRETGVDLTK 

DNMALQRVREAAEKAKCELSSSVQTDINLPYLTMDSSGPKHLNMKLTRAQFEGIVTDLIRRTIAPCQKAMQDAEV 

SKSDIGEVILVGGMTRMPKVQQTVQDLFGRAPSKAVNPDEAVAIGAAIQGGVLAGDVTDVLLLDVTPLSLGIETL 

GGVFTKLINRNTTIPTKKSQVFSTAADGQTQVEIKVCQGEREMAGDNKLLGQFTLIGIPPAPRGVPQIEVTFDID 

ANGIVHVSAKDKGTGREQQIVIQSSGGLSKDDIENMVTCNAEKYAEEDRRKKERVEAVNMAEGIIHDTET 

DQLPADECNKLKEEISKMRELLARKDSETGENIRQAASSLQQASLKLFEMAYKKMASEREGSGSSGTGEQKEDQK 

EEKQ 
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FIGURE 100 

GCCGTGTCGCCACC ATGG CTCCGCACCGCCCCGCGCCCGCGCTGCTTTGCGCGCTGTCCCTGGCGCTGTGCGCGC 
TGTCGCTGCCCGTCCGCGCGGCCACTGCGTCGCGGGGGGCGTCCCAGGCGGGGGCGCCCCAGGGGCGGGTGCCCG 
AGGCGCGGCCCAACAGCATGGTGGTGGAACACCCCGAGTTCCTCAAGGCAGGGAAGGAGCCTGGCCTGCAGATCT 
GGCGTGTGGAGAAGTTCGATCTGGTGCCCGTGCCCACCAACCTTTATGGAGACTTCTTCACGGGCGACGCCTACG 
TCATCCTGAAGACAGTGCAGCTGAGGAACGGAAATCTGCAGTATGACCTCCACTACTGGCTGGGCAATGAGTGCA 
GCCAGGATGAGAGCGGGGCGGCCGCCATCTTTACCGTGCAGCTGGATGACTACCTGAACGGCCGGGCCGTGCAGC 
ACCGTGAGGTCCAGGGCTTCGAGTCGGCCACCTTCCTAGGCTACTTCAAGTCTGGCCTGAAGTACAAGAAAGGAG 
GTGTGGCATCAGGATTCAAGCACGTGGTACCCAACGAGGTGGTGGTGCAGAGACTCTTCCAGGTCAAAGGGCGGC 
GTGTGGTCCGTGCCACCGAGGTACCTGTGTCCTGGGAGAGCTTCAACAATGGCGACTGCTTCATCCTGGACCTGG 
GCAACAACATCCACCAGTGGTGTGGTTCCAACAGCAATCGGTATGAAAGACTGAAGGCCACACAGGTGTCCAAGG 
GCATCCGGGACAACGAGCGGAGTGGCCGGGCCCGAGTGCACGTGTCTGAGGAGGGCACTGAGCCCGAGGCGATGC 
TCCAGGTGCTGGGCCCCAAGCCGGCTCTGCCTGCAGGTACCGAGGACACCGCCAAGGAGGATGCGGCCAACCGCA 
AGCTGGCCAAGCTCTACAAGGTCTCCAATGGTGCAGGGACCATGTCCGTCTCCCTCGTGGCTGATGAGAACCCCT 
TCGCCCAGGGGGCCCTGAAGTCAGAGGACTGCTTCATCCTGGACCACGGCAAAGATGGGAAAATCTTTGTCTGGA 
AAGGCAAGCAGGCAAACACGGAGGAGAGGAAGGCTGCCCTCAAAACAGCCTCTGACTTCATCACCAAGATGGACT 
ACCCCAAGCAGACTCAGGTCTCGGTCCTTCCTGAGGGCGGTGAGACCCCACTGTTCAAGCAGTTCTTCAAGAACT 
GGCGGGACCCAGACCAGACAGATGGCCTGGGCTTGTCCTACCTTTCCAGCCATATCGCCAACGTGGAGCGGGTGC 
CCTTCGACGCCGCCACCCTGCACACCTCCACTGCCATGGCCGCCCAGCACGGCATGGATGACGATGGCACAGGCC 
AGAAACAGATCTGGAGAATCGAAGGTTCCAACAAGGTGCCCGTGGACCCTGCCACATATGGACAGTTCTATGGAG 
GCGACAGCTACATCATTCTGTACAACTACCGCCATGGTGGCCGCCAGGGGCAGATAATCTATAACTGGCAGGGTG 
CCCAGTCTACCCAGGATGAGGTCGCTGCATCTGCCATCCTGACTGCTCAGCTGGATGAGGAGCTGGGAGGTACCC 
CTGTCCAGAGCCGTGTGGTCCAAGGCAAGGAGCCCGCCCACCTCATGAGCCTGTTTGGTGGGAAGCCCATGATCA 
TCTACAAGGGCGGCACCTCCCGCGAGGGCGGGCAGACAGCCCCTGCCAGCACCCGCCTCTTCCAGGTCCGCGCCA 
ACAGCGCTGGAGCCACCCGGGCTGTTGAGGTATTGCCTAAGGCTGGTGCACTGAACTCCAACGATGCCTTTGTTC 
TGAAAACCCCCTCAGCCGCCTACCTGTGGGTGGGTACAGGAGCCAGCGAGGCAGAGAAGACGGGGGCCCAGGAGC 
TGCTCAGGGTGCTGCGGGCCCAACCTGTGCAGGTGGCAGAAGGCAGCGAGCCAGATGGCTTCTGGGAGGCCCTGG 
GCGGGAAGGCTGCCTACCGCACATCCCCACGGCTGAAGGACAAGAAGATGGATGCCCATCCTCCTCGCCTCTTTG 
CCTGCTCCAACAAGATTGGACGTTTTGTGATCGAAGAGGTTCCTGGTGAGCTCATGCAGGAAGACCTGGCAACGG 
ATGACGTCATGCTTCTGGACACCTGGGACCAGGTCTTTGTCTGGGTTGGAAAGGATTCTCAAGAAGAAGAAAAGA 
CAGAAGCCTTGACTTCTGCTAAGCGGTACATCGAGACGGACCCAGCCAATCGGGATCGGCGGACGCCCATCACCG 
TGGTGAAGCAAGGCTTTGAGCCTCCCTCCTTTGTGGGCTGGTTCCTTGGCTGGGATGATGATTACTGGTCTGTGG 
ACCCCTTGGACAGGGCCATGGCTGAGCTGGCTGCCTGAGGAGGGGCAGGGCCCACCCATGTCACCGGTCAGTGCC 
TTTTGGAACTGTCCTTCCCTCAAAGAGGCCTTAGAGCGAGCAGAGCAGCTCTGCTATGAGTGTGTGTGTGTGTGT 
GTGTTGTTTCTTTTTTTTTTTTTTACAGTATCCAAAAATAGCCCTGCAAAAATTCAGAGTCCTTGCAAAATTGTC 
TAAAATGTCAGTGTTTGGGAAATTAAATCCAATAAAAACATTTTGAAGTGTG 
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FIGURE 101 

MAPHRPAPALLCALSLALCALSLPVRAATASRGASQAGAPQGRVPEARPNSMWEHPEFLKAGKEPGLQIWRVEK 
FDLVPVPTNLYGDFFTGDAYVILKTVQLRNGNLQYDLHYWLGNECSQDESGAAAIFTVQLDDYLNGRAVQHREVQ 

GFESATFLGYFKSGLKYKKGGVASGFKHWPNEVWQRLFQVKGRRWRATEVPVSWESFNNGDCFILDLGNNIH 
QWCGSNSNRYERLKATQVSKGIRDNERSGRARVHVSEEGTEPEAMLQVLGPKPALPAGTEDTAKEDAANRKLAKL 
YKVSNGAGTMSVSLVADENPFAQGALKSEDCFILDHGKDGKIFVWKGKQANTEERKAALKTASDFITKMDYPKQT 
QVSVLPEGGETPLFKQFFKNWRDPDQTDGLGLSYLSSHIANVERVPFDAATLHTSTAMAAQHGMDDDGTGQKQIW 
RIEGSNKVPVDPATYGQFYGGDSYIILYNYRHGGRQGQIIYNWQGAQSTQDEVAASAILTAQLDEELGGTPVQSR 
WQGKEPAHLMSLFGGKPMIIYKGGTSREGGQTAPASTRLFQVRANSAGATRAVEVLPKAGALNSNDAFVLKTPS 
AAYLWVGTGASEAEKTGAQELLRVLRAQPVQVAEGSEPDGFWEALGGKAAYRTSPRLKDKKMDAHPPRLFACSNK 
IGRFVIEEVPGELMQEDLATDDVMLLDTWDQVFVWVGKDSQEEEKTEALTSAKRYIETDPANRDRRXPITWKQG 
FEPPSFVGWFLGWDDDYWSVDPLDRAMAELAA 
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FIGURE 102 

GTTTCTCTCCCTGCCCCCGCGACTTCGCGCAAGATCCGGGAAGGACACCCGAGGCCCCTGGGAGACCCTGGGGAG 
GTGAAAGTCAGAGAGCGAAGCGGGCCGTGGCCCCXAGGCCTGACCCCTCCCCGCGGGGTAAGGCGGGCACCCCGC 
GAGCGCAGGGGTCCTCTTACTGCTGATGGCACCCAGCTCTGGGCCCAGACGCCGCTCACCGTCCACCGCCGGTGC 
TGGGTAAA ATGT CGGTTCCAGGACCTTACCAGGCGGCCACTGGGCCTTCCTCAGCACCATCCGCACCTCCATCCT 
ATGAAGAGACAGTGGCTGTTAACAGTTATTACCCCACACCTCCAGCTCCCATGCCTGGGCCAACTACGGGGCTTG 
TGACGGGGCCTGATGGGAAGGGCATGAATCCTCCTTCGTATTATACCCAGCCAGCGCCCATCCCCAATAACAATC 
CAATTACCGXGCAGACGGTCTACGTGCAGCACCCCAXCACCTTTTTGGACCGCCCTATCCAAATGTGTTGTCCTT 
CCTGCAACAAGATGATCGTGAGTCAGCTGTCCTATAACGCCGGTGCTCTGACCTGGCTGTCCTGCGGGAGCCTGT 
GCCTGCTGGGGTGCATAGCGGGCTGCTGCTTCATCCCCTTCTGCGTGGATGCCCTGCAGGACGTGGACCATTACT 
GTCCCAACTGCAGAGCTCTCCTGGGCACCTACAAGCGTTTG TAGG ACTCAGCCAGACGTGGAGGGAGCCGGGTGC 
CGCAGGAAGTCCTTTCCACCTCTCATCCAGCTTCACGCCTGGTGGAGGTTCTGCCCTGGTGGTCTCACCTCTCCA 
GGGGGCCCACCTTCATGTCTTCTTTTGGGGGGAATACGTCGCAAAACTAACAAATCTCCAAACCCCAGAAATTGC 
TGCTTGGAGTCGTGCATAGGACTTGCAAAGACATTCCCCTXGAGTGTCAGTTCCACGGTTTCCTGCCTCCCTGAG 
ACCCTGAGTCCTGCCATCTAACXGXGAXCATTGCCCTATCCGAAXAXCXXCCTGTGATCTGCCAXCAGTGGCTCT 
XXTTTCCXGCXTCCATGGGCCXTTCXGGTGGCAGTCTCAAACTGAGAAGCCACAGTTGCCTXATTTTTGAGGCTG 
TTCTGCCCAGAGCXCGGCXGAACCAGCCXXXAGXGCCXACCAXXAXCTTATCCGXCXCXXCCCGXCCCXGAXGAC 
AAAGAXCXXGCCTTACAGACTTXACAGGCXTGGCXTXGAGAXXCXGXAACXGCAGACTXCATXAGCACACAGATX 
CACXXXAATTTCTTAATTTTXXXXXXAAATACAAGGAGGGGGCXAXXAACACCCAGTACAGACAXATCCACAAGG 
TCGTi\AATGCAXGCTAGAAAAAXAGGGCTGGAXCXXATCACXGCCCTGTCXCCCCXXGXXXCTCTGXGCCAGATC 
TXCAGTGCCCCXXXCCATACAGGGATTTXXTXCXCAXAGAGXAAXXATAXGAACAGXXTXXATGACCXCCXXXTG 
GXCXGAAATACXXXTGAACAGAAXXXCXXTTTXXTAAAAAAAAACAGAGATGGGGTCXXACXAXGXTGCCCAGGC 
XGGTGTCGAACXCCXGGGCXCAAGCGAXCCXXCXGCCXXGGCCTCCCGAAGXGCXGGGAXXGCAGGCAXAAGCXA 
CCAXGCXGGGCCXGAACAXAAXXXCAAGAGGAGGAXXXAXAAAACCAXXXXCXGXAAXCAAAXGAXXGGXGXCAX 
TXXCCCATTXGCCAAXGXAGTCTCACTTAAAAAAAAAAAAAAGAAAAAGAAAXGGAXAAXTTCATCXACXGCCXX 
XACXXGGGGXXAATGXGAXXCTTAAACACCXTCAXCAXGGAACTCTCAGAGTGGGGXCCGTTXXGGTXXCCXGGX 
GGXGGGXXXXGAAAGAXAAGGGAAAGCACAXXXXGAGCAXGXCXGGGXACCAXGGXGCGGAXGCXXGGGAACCAG 
AACTGXTTCAGAGGAAXCXAAAGXCXGAXXXXAGXXXXCAGAGACACAGCXXGXXGTAAAACATGAGAAGACAXG 
AXXXCXAGGACXCAAGCAGCAAGCCAGGATTCXAGGXXGGCXGCXGXGTCAXCTXXGAAGXCAAGACAAAGCXGG 
GCXCGACCXXCAAGGGXCCXCGXXXXGAXAAXACXXCAGAAXAGGGAACXCAXGXGAAXACXACXAXGTAGAAAX 
AAAACCXAGACCXTGAGCGAACAXCXGXATATTGGTXGAAAACGATAGTGGXAACCAXXGAXCCCCCTTCAXTTG 
AXGXXTGGAAAAXXCCAGTAATTAXCAXXXXXGCAACGAATATGGAXACCACAXAGTACXTXGGXGXXACCTGCT 
XXXGAAAAAXAAAGXCXXXGGXXCACCCGGXAAAAAAAAAAAA 



